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mains, which are estimated to be seven transmembrane 
domains characteristic to G protein-coupled receptors, 
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ing. These genes and proteins which are the expression 
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D script) n 

Technical Field 

5 [0001] The present invention relates to novel G protein-coupled receptors and genes thereof, and production and 
uses thereof. 

Background Art 

10 [0002] G protein-coupled receptor is a generic name for the group of cell membrane receptors transducing signals 
into cells via the activation of trimer-type GTP-binding proteins. The G protein-coupled receptor has structural charac- 
teristic of seven transmembrane domains in a molecule, and thus called also as "a seven-transmembrane receptor". 
The G protein -coup led receptor transmits the information consisting of various physiologically active substances into 
cells across the cell membrane via the activation of the trimer-type GTP-binding protein and the change of the intrac- 

*s ellular second messengers caused thereby. Well-known intracellular second messengers that are regulated by the 
trimer-type GTP-binding protein include cAMP mediated by adenylate cyclase, and Ca 2+ mediated by phospholipase 
C. Recent studies have shown that many types of intracellular proteins serve as the targets thereof; for example, the 
regulation of channels and activation of phosphorylation enzymes are mediated by the trimer-type GTP-binding protein 
(Anna. Rev. NeuroscL (97)20:399). There are a wide variety of substrates (ligands) for the G protein-coupled receptor, 

20 for example, protein hormone; chemokine; peptide; amine; substances derived from lipids; and protease, such as 
thrombin, is also one such example. The number of human G protein-coupled receptors whose genes have been 
identified recently, is a little under300, excluding the sensory-type receptors. However, the number of G protein-coupled 
r ceptors to which the ligands have been identified is only about 140 types. Thus, there are 100 or more, ligand- 
unknown, "orphan G protein-coupled receptors". The human genome has been assumed to contain at least 400 types, 

25 and possibly up to 1000 types of G protein-coupled receptors (Trends Pharmacol. Sci. (97) 18:430). This means that 
the number of functionally unknown orphan G protein-coupled receptors can be exploding accompanied by the rapid 
progress of the genome analysis. 

[0003] Ninety % or more drugs that have so far been produced by the pharmaceutical companies in the world aim 
at the interaction in extracellular spaces, and low-molecular-weight drugs comprises the majority of those relating to 

30 G protein-coupled receptors. The reason is that the G protein-coupled receptor- related diseases include many types 
of diseases, such as those of the cerebral nervous system, circulatory system, digestive system, immune system, 
locomotor system, urinary system, and genital system, including genetic diseases. Thus, in recent years, many phar- 
maceutical companies retain their orphan G protein -co up led receptors found through the genome analysis, and are 
competing fiercely with each other to reveal the ligands and physiological functions. Based on this, successful cases 

35 of physiological screening of ligands to some novel G protein-coupled receptors have begun to be reported recently. 
For example, the cases of a calcitonin-related peptide receptor (J. Biol. Chem. (96) 271:11325), orexin (Ce//(98) 92: 
573), and prolactin- re I easing peptide (Nature (98) 393:272) gave a great impact to basic studies in the field of life 
science. 

[0004] In particular, as potential new targets to bring about the drug development, the orphan G protein-coupled 
40 receptors have become a center of attraction. In general, since there are no specific ligands to the orphan G protein- 
coupled receptors, it has been difficult to develop agonists or antagonists . However, in recent years, creation of orphan 
G protein-coupled receptor-targeted drugs by combining the enriched compound libraries and high-throughput screen- 
ing methods has been proposed (Trends Pharmacol. ScL (97) 18:430, Br. J. Pharm. (98) 125:1387). Specifically, in 
the creation comprises identifying physiological agonists of an orphan G protein-coupled receptor identified by genetic 
43 engineering, by functional screening utilizing alterations in the level of an intracellular messenger, cAMP or Ca 2+ , as 
an index, and then analyzing the in vivo functions. In this method, high-throughput screening achieved by using a 
compound library allows theoretically to discover surrogate agonists and antagonists specific to the orphan G protein- 
coupled receptor, and further, to develop therapeutic agents for particular diseases. 

so Disclosure of the Invention 

[0005] The present invention was achieved considering the present situation surrounding G protein-coupled recep- 
tors, and an objective thereof is to provide novel G protein-coupled receptors and their genes, and a method for pro- 
ducing and uses of th m. Another objectiv of th pres nt invention is to provide thes molecules as targets for the 
ss study of drug dev lopment. 

[0006] The present inventors studied str nuously to achieve the above-mentioned objectives, and successfully iso- 
lated nine novel genes comprising nucleotide sequences encoding hydrophobic regions considered to be seven trans- 
membrane domains, which are characteristic of the G protein-coupled receptors, by polymerase chain reaction using 
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cDNAs from human tissues as templates. These genes and the proteins as the translation products can be used in 
the screening of ligand and of agonist or antagonist useful as a pharmaceutical, or can be used for diagnosing diseases 
relating to these genes. 

[0007] Thus, the present invention relates to novel G protein -co up led receptors and the genes encoding them, and 
s the uses and production thereof. More specifically, the present invention provides: 

(1) a DNA that encodes a guanosine triphosphate-binding protein-coupled receptor, wherein said DNA is selected 
from the group consisting of the following (a) to (d): 

10 (a) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 

to 21; 

(b) a DNA comprising a coding region of the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26; 

(c) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
to 21 in which one or more amino acids are substituted, deleted, added, and/or inserted; and 

is (d) a DNA hybridizing under stringent conditions to the DNA comprising the nucleotide sequence of any one 

of SEQ ID NOs: 5 to 8 and 22 to 26; 

(2) a DNA encoding a partial peptide of a protein comprising the amino acid sequence of any one of SEQ ID NOs: 
1 to 4 and 17 to 21; 

20 (3) a vector comprising the DNA of any one of (1 ) and (2) ; 

(4) a transformant carrying the DNA of any one of (1 ) and (2) or the vector of (3); 

(5) a protein or a peptide encoded by the DNA of any one of (1) and (2); 

(6) a method for producing the protein or the peptide of (5), said method comprising the steps of culturing th 
transformant of (4) and recovering an expressed protein or peptide from the transformant or culture supernatant 

25 thereof; 

(7) a method of screening for ligands that bind to the protein of (5), said method comprising the steps of: 

(a) contacting a test sample with the protein or the peptide of (5); and 

(b) selecting compounds that binds to said protein or said peptide; 

30 

(8) a method of screening for compounds that have activity of inhibiting the binding between the protein of (5) and 
a ligand thereof, said method comprising the steps of: 

(a) contacting the protein of (5) or a partial peptide thereof with the ligand in the presence of a test sample 
35 and detecting a binding activity of said protein or said partial peptide with said ligand; and 

(b) selecting compounds that reduces the binding activity detected in step (a) as compared with a binding 
activity detected in the absence of the test sample; 

(9) a method of screening for compounds that inhibit or enhance activity of the protein of (5), said method comprising 
40 the steps of: 

(a) contacting a ligand of said protein with cells expressing said protein in the presence of a test sample, 

(b) detecting an alteration in the cells that results from binding of said ligand to said protein, and 

(c) selecting compounds that suppress or enhance the alteration detected in step (b) as compared with an 
45 alteration detected in the cells in the absence of the test sample; 

(1 0) the method of (8) or (9), wherein the alteration in cells is a change in cAMP concentration or calcium concen- 
tration; 

(1 1 ) an antibody binding to the protein of (5); 

so (1 2) a compound isolated by the method of any one of (7) to (1 0); 

(13) a pharmaceutical composition comprising the compound of (12) as an active ingredient; 

(14) the pharmaceutical composition of (13), wherein said pharmaceutical composition is formulated for the treat- 
ment of a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease; 

(15) a polynucleotide comprising at least 15 nucleotides, wherein said polynucleotide is complementary to the 
55 DNA comprising the nucl otid s quenc of any on of SEQ ID NOs: 5 to 8 and 22 to 26 or a compl mentary 

strand thereof; 

(16) a method for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's 
disease, said method comprising the steps of detecting expression of the DNA of (1) in tissues related to th 
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disease derived from a subject, or mutation in the DNA of (1 ) in the subject; and 

(17) a agent for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's 
disease, said agent comprising the antibody of (11 ) or the nucleotide of (15). 

5 [0008] As used herein, the term n G protein-coupled receptor" means a cell membrane receptor transducing signals 
into cells via the activation of the GTP-binding protein. 

[0009] As used herein, the term "ligand" means a physiological substance binding to the G protein -coupled receptor 
and transducing signals into cells. Herein, the term "physiological substance" means a compound bound to the G 
protein-coupled receptor in vivo. 
10 [0010] As used herein, the term "agonist" means a compound capable of binding to the G protein-coupled receptor 
and transducing signals into cells, including biological substances, artificially synthesized compounds, and naturally 
occurring compounds. 

[0011] As used herein, the term "antagonist" means a compound inhibiting the binding of ligand to the G protein- 
coupled receptor or inhibiting the signal transduction into cells, including biological substances, artificially synthesized 

'5 compounds, and naturally occurring compounds. 

[0012] The present invention provides novel G protein-coupled receptors and the DNAs encoding the proteins. The 
nine human cDNA clones, isolated by the present inventors and included by the present invention, were named 
"GPRv8'\ "GPRv12", M GPRv16 M , "GPRv21", M GPRv40", M GPRv47", "GPRvSV, tt GPRv71 and "GPRv72 B (as required, 
these clones are collectively referred to as "GPRv"). The nucleotide sequences of the cDNAs are shown in SEQ ID 

20 isios: 5 to 8 and 22 to 26; the amino acid sequences of the proteins encoded by the cDNAs are shown in SEQ ID NOs: 
1 to 4 and 17 to 21. 

[0013] A result obtained by BLAST search showed that amino acid sequence of all the proteins encoded by GPRv 
cDNAs exhibited significant homology to those of known G protein-coupled receptors. Specifically, w GPRv8 D exhibited 
36% homology to HUMAN VASOPRESSIN V1B RECEPTOR (P47901 , 424 aa); "GPRv12" exhibited 27% homology 

25 to RAT 5-HYDROXYTRYPTAMINE 6 RECEPTOR (P31388, 436 aa); "GPRv16" exhibited 28% homology to MOUSE 
GALANIN RECEPTOR TYPE 1 (P56479, 348 aa); "GPRv21 tt exhibited 30% homology to BOVIN NEUROPEPTIDE Y 
RECEPTOR TYPE 2 (P79113, 384 aa); "GPRv40" exhibited 34% homology to OXYTOCIN RECEPTOR (P97926, 388 
aa); H GPRv47" exhibited 43% homology to GPRX_ORYL A PROBABLE G PROTEIN-COUPLED RECEPTOR (Q911 78, 
428 aa); "GPRv51" exhibited 37% homology to PROBABLE G PROTEIN-COUPLED RECEPTOR RTA (P23749, 343 

30 aa); "GPRv71" exhibited 45% homology to Chicken P2Y PURINOCEPTOR 3 (P2Y3) (Q98907, 328 aa) ; "GPRv72" 
exhibited 30% homology to ALPHA-1 A ADRENERGIC RECEPTOR (002824, 466 aa). 
• [0014] Further, all the proteins encoded by GPRv cDNAs (hereinafter also may be referred to as "GPRv protein") , 
isolated by the present inventors, contained hydrophobic regions, which were assumed to correspond to the seven 
transmembrane domains characteristic of the G protein-coupled receptor. Based on these findings, all the GPRv cDNAs 

35 can be considered to encode proteins belonging to the G protein-coupled receptor family. The G protein-coupled re- 
ceptors have the activity for transducing signals into cells via the activation of the G protein, which is mediated by th 
ligand. As described above, the receptor are involved in many types of diseases, such as those of the cerebral nervous 
system, circulatory system, digestive system, immune system, locomotor system, urinary system, and genital system, 
including genetic diseases. Accordingly, the GPRv proteins can be used to screen for agonists and antagonists regu- 

40 lating the functions of GPRv proteins, and thus become important targets of drug development for the above diseases. 
[0015] The present invention also provides proteins functionally equivalent to the GPRv proteins. As used herein, 
the term "functionally equivalent" means that a protein of interest has biological properties identical to those of th 
GPRv proteins. The biological properties of GPRv proteins include the activity of transducing signals into cells via the 
activation of the trimer-type GTP-binding protein. According to the types of activated systems of intracellular signal 

45 transduction, the trimer-type GTP-binding proteins are categorized into three classes, namely, Gq type that increases 
the Ca 2+ level, Gs type that increases the cAMP level, and Gi type that reduces the cAMP level {Trends Pharmacol. 
Sci\ (99) 20:118). Thus, it can be assessed whether the protein of interest has biological properties identical to those 
of GPRv proteins, for example, by detecting concentration changes of cAMP or calcium in cells depending on the 
activation. 

so [0016] In an embodiment, the method for preparing a protein functionally identical to the GPRv protein includes a 
method of introducing mutations in the amino acids sequence of the protein. Such method includes, for example, site- 
directed mutagenesis {Current Protocols in Molecular Biology, edit. Ausubel et al. (1 987) Publish. John Wiley & Sons 
Section 8. 1 -8.5)). The amino acid mutations in the protein can be also occurred naturally. The present invention includes 
mutant proteins, regardl ss of being generat d artificially r naturally, in which one or mor amino acids have been 

55 substituted, deleted, ins rt d and/or add d in the amino acid sequences (SEQ ID NOs: 1 to 4 and 17 to 21) of GPRv 
proteins, but the mutant prot ins are functionally equivalent to the GPRv proteins. There is no limitation on the numb r 
of amino acid mutations and positions of the mutations in the prot ins, as far as the functions of GPRv proteins are 
retained. The number of mutations is assumed to range typically within 1 0% of the entire amino acids, preferably within 
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5% of the entire amino acids, further pr ferably within 1% fth ntir amin acids. 

[0017] In another embodiment of the invention, the method for preparing a protein functionally equivalent to the 
GPRv protein includes a method using the hybridization technique or gene amplification technique. Specifically, those 
skilled in the art can typically isolate a DNA having high homology to the DNA sequence encoding th GPRv protein 

5 (SEQ ID NOs: 5 to 8 and 22 to 26) or a partial sequence thereof from a DNA sample derived from a homologous or 
heterologous using the hybridization technique (Current Protocols in Molecular Biology, edit. Ausubel et al. (1987) 
Publish. John Wiley & Sons Section 6.3-6.4) and then obtain a protein functionally equivalent to the GPRv protein. 
Thus, the protein of the present invention also include a protein encoded by a DNA capable of hybridizing to the DNA 
encoding the GPRv protein, which is functionally equivalent to the GPRv protein. 

to [0018] Organisms to be used for isolating such a protein include, for example, rat, mouse, rabbit, chicken, pig, cattle, 
and so forth, in addition to human, but hot limited thereto. 

[0019] Typical stringent hybridization conditions for isolating a DNA encoding a protein functionally equivalent to the 
GPRv protein are those of M 1x SSC, 0.1% SDS, 37°C U or the like; more stringently, those of "0.5x SSC, 0.1% SDS, 
42°C' orthe like; much more stringently, those of "0.2x SSC, 0.1 % SDS, 65°C" orthe like. As the hybridization conditions 

is become more stringent, a DNA with higher homology to the probe sequence can be expected to be isolated. However, 
the above combinations of SSC, SDS, and temperature are only examples, and those skilled in the art can achieve 
the stringencies equivalent to the above by appropriately combining the above or other factors determining the hybrid- 
ization stringency (for example, probe concentration, probe length, time of hybridization reaction, and so forth). 
[0020] The protein encoded by a DNA isolated by using such hybridization technique typically has high homology 

20 of amino acid sequence to those of the GPRv protein. The term "high homology" means the degree of sequence 
homology of at least 40% or higher, preferably 60% or higher, further preferably 80% or higher (for example, 90% or 
higher, or 95% or higher). 

[0021 ] Identity of amino acid sequence or nucleotide sequence can be determined with the BLAST algorithm of Karlin 
and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1993). Based on this algorithm, the programs, BLASTN and 

25 BLASTX, have been developed (Altschul et al. J. Mol. Biol. 215: 403-410, 1990). When nucleotide sequences are 
analyzed by BLASTN based on BLAST, the parameters are set, for example, as follows: score= 1 00; and wordlength= 
12. Alternatively, when amino acid sequences are analyzed by BLASTX based on BLAST, the parameters are set, for 
example, as follows: score= 50; and wordlength= 3. When BLAST and the Gapped BLAST program are used for the 
analysis, the default parameters are used in each program. The specific techniques used in these analysis methods 

30 are already known (http://www.ncbi.nlm.nih.gov.). 

[0022] Further, primers are designed based on a part of the DNA sequence (SEQ ID NOs: 5 to 8 and 22 to 26) 
encoding the GPRv protein, a DNA fragment having high homology to the DNA sequence encoding the GPRv protein 
is isolated by the gene amplification technique (PCR) (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) 
Publish. John Wiley & Sons Section 6.1-6.4), and then the protein functionally equivalent to the GPRv protein can be 

35 obtained. 

[0023] The present invention also includes partial peptides of the protein of the present invention. These partial 
peptides include peptides binding to the ligand but not transducing signals. An affinity column prepared using such a 
peptide can be used suitably for ligand screening. In addition, the partial peptides of the protein of the present invention 
can be used for preparing antibodies. The partial peptides of the present invention can be produced, for example, by 

40~~ using genetic engineering techniques, known peptide syntheticThethodsTdr metriodsof"digestirig the protein of the 
present invention with an appropriate peptidase. The partial peptides^" th^pTeserit invention typically consist of 8 or 
more amino acid residues, preferably 12 or more amino acid residues (for example, 15 or more amino acid residues). 
[0024] The protein of the present invention can be prepared as a recombinant protein or natural protein. The recom- 
binant protein can be prepared, for example, as follows, by introducing a DNA encoding the protein of the present 

45 invention, which has been inserted in a vector, into an appropriate host cell and purifying the protein expressed in the 
transformant. On the other hand, the natural protein can be prepared, for example, by using the affinity column, in 
which an antibody against the protein of the present invention has been immobilized, as follows (Current Protocols in 
Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons Section 16.1-16.1 9). The antibody to be used 
in the affinity purification may be a polyclonal or monoclonal antibody. Further, the protein of the present invention can 

so be prepared by in vitro translation (see, for example, "On the fidelity of mRNA translation in the nuclease-treated rabbit 
reticulocyte lysate system. Dasso, M.C.. Jackson, R. J. (1989) NAR 17:31 29-31 44"), orthe like. 
[0025] The present invention also provides DNAs encoding the above-mentioned proteins of the present invention. 
Th re is no limitation on the type of DNA of the present invention, as far as it can encode the protein of the present 
inv ntion; comprising cDNA, genomic DNA, chemically synthesized DNA, etc. Further, when it nc desth protein of 

55 the present inv ntion, a DNA having any nucleotide sequence based on th degeneration of genetic code is included. 
The DNA of the present invention can be isolated according to a standard method, such as the hybridization method 
using a DNA s quenc encoding the GPRv protein (SEQ ID NOs: 5 to 8 and 22 t 26) or a partial sequ nee thereof 
as a probe or PCR method using primers synthesized based on these DNA sequence, as described above. 
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[0026] In addition, the present invention also provides a vector, in which the DNA of the present invention has been 
inserted. There is no limitation on the type of vector of the present invention, as far as it stably retains the inserted 
DNA. For exampl , when E. coli'xs used as a host, the preferable cloning vector is pBluescript vector (Stratagene) or 
the like. When the vector is used for the purpose of producing the protein of the present invention, an expression vector 
is especially useful. There is no limitation on the type of expression vector, as far as it direct the expression of the 
protein in vitro, in E. coli, in culture cells, in the living body, for example, pBEST (Promega) for in vitro expression; pET 
(Invitrogen) for in E. coli, ; pME18S-FL3 (GenBank Accession No. AB009864) for in culture cells; and, pME18S (Mol 
Cell Biol. 8:466-472(1988)) for in the living body of an organism are preferred vector. The insertion of the DNA of the 
present invention into a vector can be achieved according to a standard method, for example, by ligation using restriction 
enzyme sites (Current protocols in Molecular Biology, edit. Ausubel et al. (1 987) Publish. John Wiley & Sons Section 
11.4-11. 11). 

[0027] Also, the present invention provides a transformant containing the DNA of the present invention or the vector 
of the present invention. There is no limitation on the type of host cell into which the vector of the present invention is 
to be introduced, and various types of host cells can be used depending on the purposes. Exemplary eukaryotic cells, 
in which the protein is to be expressed at high levels, include COS cell and CHO cell. The vector can be introduced 
into the host cell, by a known method such as, for example, calcium-phosphate precipitation method, electroporation 
method, (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. Section 
9.1-9.9), a method with lipofectamine (GIBCO-BRL), microinjection, and so forth. 

[0028] The present invention also provides nucleotides comprising at least 15 nucleotide residues, which is comple- 
mentary to the DNA encoding the protein of the present invention (DNA comprising any one of the nucleotide sequences 
of SEQ ID NOs: 5 to 8 and 22 to 26 or the complementary strand thereof). The term "complementary strand" means 
one strand complementary to the other strand of the two of double-stranded nucleic acid consisting of A:T (U in the 
case of RNA) and G:C nucleotide pairs. Fur ther,Jhe term "complementary^ means not only being a perfect comple- 
mentary sequence in a region of at least consecutive 16 nuclebtide^resldues, but also nucleotide sequences with at 
least 70% homology, preferably at least 80%, more preferably 90%, further preferably 95% of homology or higher. The 
algorithm described herein can be used for determining homology. These nucleotides can be used as probes for de- 
tecting and isolating the DNA of the present invention, and as primers for amplifying the DNA of the present invention. 
When used as the primer, it typically comprises 1 5 bp -1 00 bp, preferably of 1 5 bp -35bp of nucleotides. Alternatively, 
when used as the probe, it is at least 15 bp of nucleotide containing at least a part of the DNA of the present invention 
orthe entire sequence. Preferably, such nucleotide specifically hybridize to the DNA encoding the protein of the present 
invention. The term "specifically hybridizing" means that a DNA hybridizes to the 1 DNA encoding the protein of the 
present invention (SEQ ID NOs: 5 to 8 and 22 to 26) but not to DNAs encoding other proteins, undertypical hybridization 
conditions, preferably under stringent conditions. 

[0029] These nucleotides can be used for testing and diagnosing abnormalities of the protein of the present invention. 
For example, abnormal expression of the DNA encoding the protein of the present invention can be tested by Northern 
hybridization or RT-PCR using these nucleotides as probes or primers. The nucleotides can be used, for example, in 
the tests forcancers, cirrhosis, or Alzheimer's disease. In addition, the DNA encodingthe protein of the present invention 
orthe regulatory region for the expression is amplified by polymerase chain reaction (PCR) using the nucleotides as 
primers, and then abnormalities in the DNA sequence can be tested and diagnosed by using the methods such as 
RFLP analysis, SSCP, and sequencing. 

[0030] Moreoverrthe^antiserise DNA for^uppl^ssihg^xpression of fhe~protein oTpresentlnvention is included in 
these nucleotides. In order to cause the antisense effect, antisense~DNA comprises at least 15 bp of nucleotides or 
more, preferably 100 bp, more preferably 500 bp or more, and usually comprises 3000 bp or less, preferably 2000 bp 
or less. Such antisense DNA may be applied to the gene therapy for the disease resulting from the abnormalities 
(abnormalities of function or expression) of the protein of present invention and so forth. This antisense DNA can be 
prepared, for example, based on the sequence information of DNA (for example, from SEQ ID NO: 5 to 8 and 22 to 
26) encoding the protein of the present invention, by the phosphorothioate method (Stein, 1 988 Physicochemical prop- 
erties of phosphorothioate oligodeoxynucleotides. Nucleic Acids Res 16. 3209-21 (1988)), etc. 

[0031] For gene therapy, the nucleotide of a present invention can be administered to a patient by ex vivo method, 
in vivo method, and so forth using virus vectors, such as a retrovirus vector, an adenovirus vector, and an adeno 
associated virus vector, and non-virus vectors, such as liposome, etc. 

[0032] Further, the present invention provides the antibody bound with the protein of the present invention. There is 
no limitation in the form of the antibody of the present invention, and a polyclonal antibody and a monoclonal antibody, 
orapartther of having antig naffinityar alsoinclud d.Moreov r, the antibody of all class sisinclud d. Furth rmore, 
the antibody of a present invention also includ special antibodi s, such as a humanized antib dy. 
[0033] For a polyclonal antibody, the antibody of the present invention can be obtained by synthesizing oligopeptides 
corresponding to the amino acid sequence of the protein of the present invention according to a standard, and then 
immunized to rabbit (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. 
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Section 11.12-11 .13). For a monoclonal antibody, the hybridoma cell prepared by the cell fusion of the spleen cell and 
myeloma cell of the mouse immunized using the protein expressing in E. coli andthen purified according to the standard 
method, and the antibody of the present invention can be obtained from this hybridoma cell (Current protocols in 
Molecular Biojogy, edit. Ausubel et al. (19B7) Publish. John Wiley & Sons. Section 11.4-11.11). 

5 [0034] In addition to purifying of the protein of the present invention, the antibody bound with the protein of the present 
invention may be also used for a test and a diagnosis of the abnormalities in expression or in structure of the protein 
of a present invention. Specifically, protein can be extracted from tissue, blood, or cell, and then can be used for the 
test and the diagnose for presence or absence of the abnormalities of expression or structure, via a detection of the 
protein of the present invention by Western blotting method, immunoprecipitation, ELISA, and so forth. The antibody 

10 of the present invention may be used for a test of cancer, liver cirrhosis, or Alzheimer's disease. 

[0035] Moreover, the antibody bound with the protein of the present invention may be used for the purposes of, such 
as treatment of the disease relevant to the protein of the present invention. The antibody of the present invention can 
effect as the agonist and antagonist for the protein of the present invention. When using an antibody for the purpose 
of treatment of a patient, an antibody derived from human or a humanized antibody is preferable because of little 

is immunogenicity. An antibody derived from human can be prepared by immunizing the mouse of which immune system 
is replaced with those of human (for example, refer to "Functional transplant of megabase human immunoglobulin loci 
recapitulates human antibody response in mice" Mendez, M.J. et al. (1997) Nat. Genet. 15: 146-156). Moreover, a 
humanized antibody can be prepared by recombination with the hypervariable region of a monoclonal antibody (Meth- 
ods in Enzymology 203, 99-121 (1991)). 

20 [0036] Further, the present invention also provides a screening method for ligands binding to the protein of the present 
invention using the protein of the present invention. This screening method comprises the step of: (a) contacting a test 
sample with the protein of the present invention or a partial peptide thereof; and (b) selecting compounds binding to 
fhe proteih or the~paltial peptide thereof . " ~ ~~ 

[0037] Without limiting, the test sample Include, for example, known compounds or peptides (for example, deposited 
25 in the Chemical File) whose activities as the ligands to the various G protein -coupled receptore have not yet been 
identified or a group of random peptides which have been prepared by phage display method (J. Mot. BioL (1 991) 222, 
301-310). Further, culture supernatant? of microorganisms, natural ingredients from plants or marine organisms, and, 
in addition to these, biological extracts from tissues including brain, cell extracts, expression products of gene libraries, 
but not limited thereto, can be screened. 
30 [0038] The protein of the present invention to be used for the screening can be, for example, the form displayed on 
cell surface, the form as the cell membrane fraction of the cells, or the form immobilized in an affinity coJumn. 
-[0039] Specific screening methods include many known methods such as, for example, a method of contacting a 
test sample with an affinity column of the protein of the present invention and purifying compounds bound to the protein 
of the present invention; and Western blotting method. When these methods are used, the test sample is labeled 
35 appropriately and the binding with the protein of the present invention can be detected by using the label. In addition 
to these methods, another method can be used; in which cell membranes expressing the protein of the present invention 
are prepared and immobilized on a chip, and the alterations in surface plasmon resonance, which represent the dis- 
sociation of the trimer-type GTP-binding protein during the ligand binding, are detected {Nature Biotechnology (99) 1 7: 
1105). 

40 [0040] Further, trie binding activity between a~ test sample and the protein of thepresent invention can be detected 
for alterations as indices in cells, which is caused by the binding of the test sample to the protein of the present invention 
expressed on cell surface. Such alterations include, for example, alterations of intracellular Ca 2+ level and cAMP levels, 
but not limited thereto. Specifically, the agonist activity to the G protein-coupled receptor can be assayed by GTPyS 
binding method. 

45 [0041] In an example where this method is used, cell membranes on which the G protein-coupled receptor has been 
displayed are mixed with 400 pM 35 S-labeled GTP7S in a solution containing 20 mM HEPES (pH 7.4), 1 00 mM NaCI, 
1 0 mM MgCI 2 , and 50 uJvl GDP, the mixture is incubated either in the presence or in absence of a test sample and then 
filtrated, and the radioactivities of the bound GTP7S are compared. 

[0042] Further, the G protein-coupled receptors share the system of transducing signals into cells via the activation 
50 of the trimer-type GTP-binding protein. The trimer-type GTP-binding proteins are categorized into three classes de- 
pending on the types of activated systems of intracellular signal transduction: namely, Gq type that increases the Ca 2+ 
level; Gs type that increases the cAMP level; and Gi type that reduces the cAMP. Thus, the use of a chimeric protein 
consisting of a-subunit from Gq protein and a-subunit from another type of G protein, or promiscuous Got proteins, 
Gct1 5 and Ga1 6, allows the positive signal in th ligand screening to result in increas d Ca 2+ lev Is in the pathway of 
55 Gq intracellular signal transduction. Th increas d Ca 2+ levels can be detected by using, as indices, alt red levels of 
a reporter gene having TRE (TPA responsive element) or M RE (multiple responsive element) on upstream, dye indicator 
such as Fura-2 and Fluo-3, and fluorescent protein aequorin. Similarly, th us of a chimeric protein consisting of a- 
subunit from Gs protein and a-subunit from another type of G protein allows the positive signal to result in increased 
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cAMP levels in the pathway of Gs intracellular signal transduction, and the increased levels can be detected by using, 
as indices, altered levels of a reporter gene having CRE (cAMP -responsive element) on upstream (Trends Pharmacol. 
Sd. (99)20:118). 

[0043] There is no limitation on the type of host cell to be used for the expression of the protein of the present invention 
in this screening system, and various types of host cells can be used depending on the purposes. Such host cells 
include, for example, COS cell, CHO cell, HEK293 cell, etc. The vectors directing the expression of the protein of the 
present invention in vertebrate cells, comprising the promoter upstream of the gene encoding the protein of the present 
invention, RNA splice site, polyadenylation site, and transcription termination sequence and replication origin, and so 
forth can be preferably used. For example, pSV2dhfr (Mot. Cell. Biol. (1981) 1, 854-864), pEF-BOS (Nucleic Acids 
Res. (1990) 18, 5322), pCDM8 (Nature (1987) 329, 840-842), and pCEP4 (Invitrogen), containing the SV40 early 
promoter, are useful vectors for the expression of the G protein-coupled receptor. The insertion of the DNA of the 
present invention into a vector can be achieved according to a standard method by ligation using restriction enzyme 
sites (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. Section 
11.4-11.11). Further, the vector introduction into a host cell can be achieved by a known method, for example, such as 
calcium-phosphate precipitation method, electroporation method (Current protocols in Molecular Biology, edit. Ausubel 
et al. (1987) Publish. John Wiley & Sons. Section 9.1-9.9), a method with lipofectamine (GIBCO-BRL), a method with 
FuGENE6 reagent (Boeh ringer-Man helm), microinjection method, etc. 

[0044] Once the ligands are isolated by the above screening method for ligands binding to the protein of the present 
invention, screening of compounds inhibiting the interaction between the protein of the present invention and the ligands 
can be achieved. Thus, the present invention provides a screening method for compounds having the activity of inhib- 
iting the binding of the protein of the present invention and the ligand thereof. This screening method comprises the 
st p of: (a) contacting the ligand with the protein of the present invention or a partial peptide thereof in the presence 
of a test sample, and detecting the binding activity of the protein or a partial peptide thereof with the ligand; and (b) 
selecting compounds reducing the binding activity detected in the step (a) relative to the binding activity in the absence 
f the test sample. 

[0045] Without limiting, the test sample include, for example, a group of compounds obtained by combinatoriai chem- 
istry technology (Tetrahedron (1995) 51 , 8135-8137), a group of random peptides prepared by phage display method 
(J. Mol. Biol. (1991) 222, 301-310), and such. Further, culture supernatants of microorganisms and natural ingredients 
from plants or marine organisms, and in addition to these, biological extracts from tissues including brain, cell extracts, 
expression products of gene libraries, synthetic low-molecular-weight compounds, synthetic peptides, natural com- 
pounds, and so forth can be screened, but not limited thereto. 1 

[0046] The protein of the present invention to be used for the screening can be, for example, the form expressed on 
cell surface, the form in the cell membrane fraction of the cells, or the form immobilized in an affinity column. 
[0047] Specific methods that can be used for the screening include, for example, a method in which the ligand is 
labeled with a radioisotope or the like, and contacted with the protein of the present invention in the presence of a test 
sample, and then, based on the label linked to the ligand, compounds reducing the binding activity of the protein of 
the present invention to the ligand are detected as compared to those detected in the absence of the test sample. 
Further, the screening can also be achieved by using the intracellular alterations as an index by the same method as 
used in the above-mentioned screening to isolate ligands capable of binding to the protein of the present invention. 
Specifically, the screening for a compound inhibiting the binding of the protein of the present invention with the ligand 
can be carried out by contacting cells expressing the protein of the present invention with the ligands in the presence 
of a test sample, and selecting a compound decreasing the degree of alteration in the cells as compared with those 
detected in the absence of the test sample. The cells expressing the protein of the present invention can be prepared 
by the same method as used in the above-described screening of ligands binding to the protein of the present invention. 
The compounds isolated by the screening can be candidates for the agonist or antagonist to the protein of the present. 
[0048] Further, the present invention provides a screening method for compounds inhibiting or enhancing the activity 
of the protein of the present invention. This screening method comprises the step of: (a) contacting the ligand to the 
protein with cells expressing the protein of the present invention in the presence of a test sample; (b) detecting an 
alt ration in the cells due to the binding of the ligand to the protein of the present invention; and (c) selecting compounds 
suppressing or enhancing the alteration in the cells detected in the step (b) as compared with the alteration of the cells 
in the absence of the test sample. 

[0049] Such test samples to be used, like those to be used in the above-mentioned screening method for ligands 
binding to the prot in of the present inv ntion, include a group of compounds obtained by combinatorial ch mistry 
t chnology, a group of random peptides prepared by using phage display method, culture supernatants of microorgan- 
isms, natural ingredients from plants or marine organisms, biological tissu extracts, cell extracts, expr ssion pr ducts 
of gene libraries, synthetic low-mol cular-weight compounds, synth tic peptides, natural compounds, and such. Fur- 
ther, the compounds isolated by the abov -mentioned screening of ligands binding to the protein f the present invention 
can be used as the test samples. The cells expressing the protein of the present invention can be prepared by the 



8 



EP 1 243 648 A1 



same method as th above-described screening of ligands binding to the protein of the present invention. The alteration 
in the cells after contact d with the test sample can be d tected by using the alteration of intracellular Ca 2+ level or 
cAMP level as an index, as with the above screening method. Further, the intracellular signal transduction can also be 
detected by using an assay system such as a reporter assay using luciferase as a reporter gene, 

5 [0050] When the result of the detection shows that the alteration in the cells contacted with a test sample is sup- 
pressed as compared to those in the cells contacted with the ligand in the absence of the test sample, the test sample 
used is determined to be a compound inhibiting the activity of the protein of the present invention. Conversely, when 
the test sample enhances the alteration in the cells, the compound is determined to be a compound enhancing the 
activity of the protein of the present invention. The term "enhancing or inhibiting the activity of protein of the present 

10 invention" means that, regardless of a direct or an indirect interaction to the protein of the present invention, the inter- 
action results in the enhancement or inhibition of the activity of protein of the present invention. Accordingly, the com- 
pounds isolated by the screening include compounds acting on the protein of the present invention or the ligand and 
inhibiting or enhancing the activity of the protein of the present invention by inhibiting or enhancing the binding, as well 
as compounds which do not inhibit nor enhance the binding itself but result in the inhibition or enhancement of the 

15 activity of the protein of the present invention. Such compounds include, for example, compounds which do not inhibit 
nor enhance the binding of the protein of the present invention and the ligand but inhibit or enhance the pathway of 
intracellular signal transduction. 

[0051] When the compounds isolated by the screening method of the present invention are used as pharmaceuticals, 
th isolated compound not only can be directly administered itself to patients but also can be administered as phar- 

20 maceutical compositions which have been formulated by a known pharmaceutical method. For example, thecompound 
can be formulated, in a form suitable for oral or parenteral administration, as a pharmaceutical composition obtained 
by combining the compound with pharmaceutically acceptable carrier (for example, excipient, binder, disintegrator, 
flavor, corrigent, emulsifier, diluent, solubilizer, etc.), or preparations, such as tablet, pill, powder, granule, capsule, 
troche, syrup, liquid drug, emulsion, suspension, injection (e.g. liquid drug and suspension), suppository, inhalant, 

25 percutaneous absorbent, eye drop, eye ointment, and so forth,. In general, the administration to patients can be carried 
out by a method known to those skilled in the art, including intraarterial injection, intravenous injection, subcutaneous 
injection, etc. While the doses are different depending on the weight and age of patient, administration method, and 
such, those skilled in the art can chose proper administration doses if necessary. Further, when the compound is 
encoded by a DNA, the DNA can be inserted into a vector for gene therapy and thus can be used for gene therapy. 

30 The compound isolated by the screening method of the present invention is expected to be applied to the treatment 
of, for example, cancers, cirrhosis, and Alzheimer's disease. 

[0052] The present invention also provides a disease diagnosing method for cancers, cirrhosis, or Alzheimer's dis- 
ease, comprising the step of detecting the expression of the gene encoding the GPRv protein of the present invention. 
[0053] In the Example herein, it has been found that the expression levels of the genes encoding the GPRv proteins 
35 of the present invention in affected tissues associated with cancers, cirrhosis, or Alzheimer's disease are significantly 
different as compared to those in normal tissues. Thus, these diseases can be diagnosed by detecting the expression 
of the genes encoding the GPRv proteins of the present invention in tissues of subjects. The term "gene expression" 
means both transcription and translation. 

[0054] The diagnosis method of the present invention can be carried out, for example, as follows. 

40 [0055] The diagnosis can be achieved by extracting RNA from an aliquot of a tissue collected by biopsy or blood 
sample according to a standard method, and quantifying GPRv mRNA by quantitative PCR, Northern hybridization, or 
dot blot hybridization, and such, as described in the Example herein. Alternatively, the diagnosis can also be achieved 
by quantifying the GPRv protein in a protein extract from the above tissue by a method such as Western blotting, 
immunoprecipitation, ELISA, and such, or by a noninvasive method where a labeled compound or antibody binding to 

45 the GPRv protein is administered to patients and detected by PET (positron emission tomography) or the like. 

[0056] When the result of the diagnosis shows that the gene expression in the tissues of a subject exhibits a pattern 
(for example, an increased or decreased gene expression level as compared to that in the normal tissue) identical to 
that of the gene expression in the tissue derived from a patient affected with any one of the above diseases, the subject 
is determined as being affected or as being at a risk of affection with the disease. 

50 [0057] For example, the expression of GPRv8 was detectable in the colon, and the expression level was markedly 
higher in colon cancers. Accordingly, when the expression of GPRv8 is detected at a high level in the colon tissue of 
a subject, the subject is suspected of colon cancer. Alternatively, the expression of GPRv8 was undetectable in the 
normal pancreas and uterus, but GPRv8 was expressed at a moderate level after canceration. Accordingly, when the 
xpression of GPRv8 can be d tected in the pancreas r uterus of a subject, the subject is suspected f pancreatic 

55 cancer or ut rin cancer. 

[0058] The expression of GPRv12 was undetectable in the normal ovary and testis, but was detectable after can- 
ceration. Further, the expression lev I decreased in the hippocampus with Alzheim r*s disease. Accordingly, when the 
expression of GPRv12 is detected in the ovary or testis of a subject, the subject is suspected of ovary cancer or 
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testicular cancer. Similarly, when the expression of GPRvl 2 is detected in the hippocampus of a subject at a lower 
level than the normal level, the subject is suspected of Alzheimer's disease. 

[0059] GPRvl 6 was expressed in the colon, but was undetectable after canceration. The expression level increased 
in the brain after canceration. In the liver, the expression was undetectable after cirrhosis. In the brain of patients with 

5 Alzheimer's disease, the expression level was elevated at the hippocampus. Accordingly, when the expression of 
GPRvl 6 is detected in the colon of a subject at a lower level than the normal level, the subject is suspected of colon 
cancer. Further, when the expression is detected in the brain at a higher level than the normal level, the subject is 
suspected of brain cancer. Further, the expression of GPRvl 6 is detected in the liver at a lower level than the normal 
level, the subject is suspected of cirrhosis. Further, when the expression of GPRvl 6 is detected in the hippocampus 

10 at a higher level than the normal level, the subject is suspected of Alzheimer's disease. 

[0060] The expression of GPRv21 was undetectable in the colon and testis after canceration. Accordingly, when the 
expression of GPRv21 is detected in the colon or testis of a subject at a lower level than the normal level, the subject 
is suspected of colon cancer or testicular cancer. 

[0061] The expression level of GPRv40 increased in the brain and testis after canceration, and decreased in the 
15 liver after cirrhosis. Accordingly, when the expression of GPRv40 is detected in the brain or testis at a higher level than 
the normal level, the subject is suspected of brain tumor or testicular cancer. Further, when the expression of GPRv40 
was detected in the liver at a lower level than the normal level, the subject is suspected of cirrhosis. 
[0062] The expression level of GPRv47 increased in the brain and kidney and decreased in the testis, after cancer- 
ation. The expression was undetectable in the liver after cirrhosis. Accordingly, when the expression of GPRv47 is 
20 detected in the brain or kidney at a higher level than the normal level, the subject is suspected of brain tumor or kidney 
cancer. Further, when the expression of GPRv47 is detected in the liver at a lower level than the normal level, the 
subject is suspected of cirrhosis. 

[0063] The expression level of GPRv51 decreased in the colon and testis after canceration. The expression level 
also decreased in the liver after cirrhosis as compared to the normal liver. The expression level increased in the hip- 

25 pocampus with Alzheimer's disease. Accordingly, when the expression of GPRv51 is detected in the colon and testis 
at a lower level than the normal level, the subject is suspected of colon cancer or testicular cancer. Further, when the 
xpression of GPRv51 is detected in the liver at a lower level than the normal level, the subject is suspected of cirrhosis. 
Further, when the expression of GPRv51 is detected in the hippocampus at a higher level than the normal level, the 
subject is suspected of Alzheimer's disease. 

30 [0064] The expression level of GPRv71 decreased in the colon and kidney, and was undetectable in the liver, after 
cirrhosis. In Alzheimer's disease, the expression level decreased in the frontal lobe.' Accordingly, when the expression 
of GPRv71 is detected in the colon or kidney at a lower level than the normal level, the subject is suspected of colon 
cancer or kidney cancer. Further, when the expression of GPRv71 is detected in the liver at a lower level than the 
normal level, the subject is suspected of cirrhosis. Further, when the expression of GPRv71 is detected in the frontal 

35 lobe at a lower level than the normal level, the subject is suspected of Alzheimer's disease. 

[0065] GPRv72 was expressed strongly in the colon, but the expression was undetectable after canceration. The 
expression level of GPRv72 increased in the hippocampus with Alzheimer's disease. Accordingly, when the expression 
of GPRv72 is detected in the colon at a lower level than the normal level, the subject is suspected of colon cancer. 
Further, when the expression of GPRv72 is detected in the hippocampus at a higher level than the normal level, the 

40 subject is suspected of Alzheimer's disease. 

[0066] Furthermore, mutations in the genes encoding GPRv proteins of the present invention may result in the onset 
of the above-mentioned diseases. Thus, the diagnosis for the above-mentioned diseases can be carried out by de- 
tecting such mutations in the genes encoding GPRv proteins of the present invention. 
[0067] Such gene diagnosis can be carried out, for example, as follows. 

45 [0068] As a nucleic acid to be used for the diagnosis, genomic DNA or cDNA may be amplified directly or by PCR 
or other amplification technique. Deletions and insertions can be detected based on size differences of the amplification 
products as compared with that of the normal gene. Point mutations can be identified based on the differences in the 
melting temperature of the amplified DNA hybridized with DNA encoding GPRv. Differences between DNA sequences 
can be found by detecting alterations in the electrophoretic mobility of DNA fragment in a denaturant-containing or 

50 denaturant-free gel or by direct sequencing of nucleotide sequence of DNA. 

[0069] When the diagnosis result shows that the gene encoding the GPRv protein from a subject has mutations as 
compared with the wild-type sequence, the subject diagnosed to be suspected of the above disease. 
[0070] Namely, a method for diagnosing cancers, cirrhosis, or Alzheimer's disease or a method for diagnosing the 
susceptibility to th diseases ar provid d by det cting, according to th m thod described her in, mutations in the 

55 genes ncoding the GPRv proteins or increase or decreas in th expression I vels of the mRN As or proteins. 
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Bri f Description of the Drawings 
[0071] 

5 Figure 1 shows a result of BLAST SEARCH with the u GPRv8 M amino acid sequence as the query against the entire 

sequence data in SWISS-PROT. The sequence showed 36% homology to HUMAN VASOPRESSIN V1B RECEP- 
TOR. 

Figure 2 shows a result of BLAST SEARCH with the "GPRv12 u amino acid sequence as the query against the 
entire sequence data in SWISS-PROT. The sequence showed 27% homology to RAT 5-HYDROXYTRYPTAMINE 
w 6 RECEPTOR. 

Figure 3 shows a result of BLAST SEARCH with the "GPRv16" amino acid sequence as the query against the 
entire sequence data in SWISS-PROT. The sequence showed 28% homology to MOUSE G ALAN IN RECEPTOR 
TYPE 1. 

Figure 4 shows a result of BLAST SEARCH with the ,, GPRv21" amino acid sequence as the query against the 
15 entire sequence data in SWISS-PROT The sequence showed 30% homology to BOVIN NEUROPEPTIDE Y RE- 

CEPTOR TYPE 2. 

Figure 5 shows a result of BLAST SEARCH with the M GPRv40 M amino acid sequence as the query against the 
entire sequence data in SWISS-PROT The sequence showed 34% homology to OXYTOCIN RECEPTOR 
(P97926). 

20 Figure 6 shows a result of BLAST SEARCH with the "GPRv47 M amino acid sequence as the query against the 

entire sequence data in SWISS-PROT. The sequence showed 43% homology to GPRX_ORYLA PROBABLE G 
PROTEIN-COUPLED RECEPTOR (Q91178). 

Figure 7 shows a result of BLAST SEARCH with the M GPRv51 n amino acid sequence as the query against the 
entire sequence data in SWISS-PROT. The sequence showed 37% homology to PROBABLE G PROTEIN-COU- 
25 PLED RECEPTOR RTA (P23749). 

Figure 8 shows a result of BLAST SEARCH with the u GPRv71 n amino acid sequence as the query against the 
entire sequence data in SWISS-PROT The sequence showed 45% homology to P2Y PURINOCEPTOR 3 (P2Y3) 
(Q98907). 

Figure 9 shows a result of BLAST SEARCH with the M GPRv72" amino acid sequence as the query against the 
30 entire sequence data in SWISS-PROT The sequence showed 30% homology to ALPHA-1 A ADRENERGIC RE- 

CEPTOR (002824). 

Figure 1 0 shows a hydropathy plot for GPRv8. 

Figure 11 shows an alignment of GPRv8 and similar families. The mark '*' means that the amino acid is completely 
conserved in all the sequences at the position marked therewith. The mark ':' means that amino acids at the position 

35 marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK} t {NDBQ}, {QHRK}, 

{MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acids at the position marked therewith are con- 
served within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK}, {ND- 
EQHK}, and {NEQHRK}. 
Figure 1 2 is continued from Figure 11. 

40 Figure 1 3 shows a hydropathy plot for GPRvl 2. 

Figure 14 shows an amino acid sequence alignment of GPRvl 2 and AF208288. The mark **' means that the amino 
acid is completely conserved in all the sequences at the position marked therewith. The mark 7 means that amino 
acids at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, 
{NHQK}, {NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acids at the position 

45 marked therewith are conserved within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK), {STPA}, 

{SGND}, {SNDEQK}, {NDEQHK}, and {NEQHRK}. 
Figure 15 shows a hydropathy plot for GPRvl 6. 

Figure 16 shows a summary of HMMPFAM, transmembrane domain, and S-S bond of GPRvl 6. The mark ***** 
indicates a region assigned as 7tm_1 based on the result of HMMPFAM. The mark "###" represents transmem- 
so brane domain. The mark "@" indicates Cys capable of forming S-S bond. 

Figure 1 7 shows a hydropathy plot for GPRv21 . 

Figure 18 shows an amino acid sequence alignment of GPRv21 and the related proteins. The mark '•' means that 
the amino acid is completely conserved in all the sequences at the position marked therewith. The mark 7 means 
that amino acids at the position mark d ther with ar conserv d within any one of th following groups: {STA}, 
55 {NEQK}, {NHQK}, {NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark 7 m ans that amino acids at the 

position mark d therewith are conserved within any on of the following groups: (CSA), {ATV}, {SAG), {STNK}, 
{STPA}, (SGND), {SNDEQK), {NDEQHK), and {NEQHRK). 
Figure 1 9 is continued from Figure 18. 
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Figure 20 shows a hydropathy plot for GPRv40. 

Figure 21 shows a summary of HMMPFAM, transmembrane domain, and S-S bond of GPRv40. The mark 
indicates a region assigned as 7tm_1 based on the result of HMMPFAM. The mark "###" indicates transmembrane 
domain. The mark "@" indicates Cys capable of forming S-S bond. 
Figure 22 shows a hydropathy, plot for GPRv47. 

Figure 23 shows an alignment of GPRv47 and the related proteins. The mark '*' means that the amino acid is 
completely conserved in all the sequences at the position marked therewith. The mark ':' means that amino acids 
at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK} {NHQK} 
{NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark '.' means that amino acids at the position marked 
therewith are conserved within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK), {STPA}, {SGND} 
{SNDEQK}, {NDEQHK}, and {NEQHRK}. 
Figure 24 is continued from Figure 23. 
Figure 25 is continued from Figure 24. 
Figure 26 shows a hydropathy plot for GPRv51 . 
is Figure 27 shows an alignment of GPRv51 and the related proteins. The mark ■*' means that the amino acid is 

completely conserved in all the sequences at the position marked therewith. The mark ':' means that amino acid 
at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, 
{NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acid at the position marked 
therewith are conserved within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK}, {STPA} {SGND} 
20 {SNDEQK}, {NDEQHK}, and {NEQHRK}. ' 

Figure 28 shows a hydropathy plot for GPRv71 . 1 

Figure 29 shows an alignment of GPRv71 and related proteins. The mark means that the amino acid is completely 
conserved in all the sequences at the position marked therewith. The mark 7 means that amino acid at the position 
marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, {NDBQ}, {QHRK}, 

*5 {MILV}, {MILF}, {HY}, and {FYW}. The mark V means that amino acid at the position marked therewith are con- 

served within any one of the following groups : {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK} {ND- 
EQHK}, and {NEQHRK}. 
Figure 30 is continued from Figure 29. 
Figure 31 shows a hydropathy plot for GPRv72. 

30 Figure 32 shows an alignment of GPRv72 and related proteins. The mark'*' means that the amino acid is completely 

conserved in all the sequences at the position marked therewith. The mark 7 means that amino acid at the position 
marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, {NDBQ}, {QHRK}, 
{MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acid at the position marked therewith are con- 
served within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK) {ND- 

35 EQHK}, and {NEQHRK}. 

Figure 33 is continued from Figure 32. 
Figure 34 is continued from Figure 33. 



40 
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Best Mode for Carrying out the Invention 

[0072] The present invention is specifically illustrated below with reference to Examples, but it is not to be construed 
as being limited thereto. Unless otherwise stated, they can be carried out by known methods (Maniatis, T. et al. (1 982): 
"Molecular Cloning - A Laboratory Manual", Cold Spring Harbor Laboratory, NY). 

[Example 1] Isolation of the genes encoding the novel G protein-coupled receptors 

[0073] The full-length cDNAs encoding the novel G protein -co up led receptors of the present invention (GPRv8, 
GPRv12, GPRv16, GPRv21 , GPRv40, GPRv47, GPRv51 , GPRv71 , and GPRv72) were obtained by PCR. 
[0074] The amplification of the novel G protein-coupled receptor GPRv8 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetus as a template, and forward primer: S'-ATGCCAGCCAACTTCACAGAG- 
GGCAGCT-3* (SEQ ID NO: 9) and reverse primer: 5'-CTAGATGAATTCTGGCTTGGACAGAATC-3' (SEQ ID NO: 10). 
PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C 
(2.5 minutes) and 25 cycles of 94°C (30 seconds)/60°C (30 seconds)/72°C (1 minute). The amplification r suited in 
about 1.1-kbp DNA fragments. Th fragments wer clon d into pCR2.1 plasmid (Invitrog h).Th nucl otides quence 
of the resultant clone was det rmined by dideoxy terminator method in an ABI377 DNA Sequ ncer (Applied Biosys- 
tems). The determin d sequence is shown in SEQ ID NO: 5. 

[0075] The sequence comprises an open reading frame of 1116 nucleotides (from the first nucleotide to the 1116th 
nucleotide in SEQ ID NO: 5). An amino acid sequence deduced from the open reading frame (371 amino acids) is 
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shown in SEQ ID NO: 1 . Sine the deduced amino acid sequenc contains hydrophobic regi nsc rr sponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 
coupled receptor. 

[0076] The amplification of the novel G protein-coupled receptor GPRvl 2 was carried out using a Marathon Ready 
5 cDNA (Clontech) derived from human fetal brain as atemplate, and forward primer: 5'-ATGGGCCCCGGCGAGGCGCT- 
GCTGGCGG-3' (SEQ ID NO: 11) and reverse primer: 5'-TCAGTGTGTCTGCTGCAGGCAGGAATCA-3' (SEQ ID NO: 
12). PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal 
cycling profile consisted of preheat at 94°C (2.5 minutes), 5 cycles of 94°C (5 seconds)/72°C (4 minutes) , 5 cycles of 
94°C (5 seconds)/70°C (4 minutes) , and 25 cycles of 94°C (5 seconds)/68°C (4 minutes). The amplification resulted 
10 in about 1 .1-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide se- 
quence of the resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied 
Biosystems). The determined sequence is shown in SEQ ID NO: 6. 

[0077] The sequence comprises an open reading frame of 1 092 nucleotides (from the first nucleotide to the 1 092th 
nucleotide in. SEQ ID NO: 6). An amino acid sequence deduced from the open reading frame (363 amino acids) is 
is shown in SEQ ID NO: 2. Since the deduced amino acid sequence contains hydrophobic regions corresponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 
coupled receptor. 

[0078] The amplification of the novel G protein-coupled receptor GPRvl 6 was carried out using a Marathon Ready 
cDNA (Clonlech) derived from human brain as a template, and forward primer: 5*-ATGCTGGCAGCTGCC l l l GCA- 

20 G ACTCTAAC -3' (SEQ ID NO: 13) and reverse primer: S'-CTATTTAACACCTTCCCCTGTCTCTTGATC-S' (SEQ ID NO: 
14). PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 
94°C (2 minutes) and 30 cycles of 94°C (30 seconds)/60°C (30 seconds)/72°C (1 minute). The amplification resulted 
in about 1 .2-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide se- 
quence of the resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied 

25 Biosystems) . The determined sequence is shown in SEQ ID NO: 7. 

[0079] The sequence comprises an open reading frame of 1 260 nucleotides (from the first nucleotide to the 1 260th 
nucleotide in SEQ ID NO: 7). An amino acid sequence deduced from the open reading frame (419 amino acids) is 
shown in SEQ ID NO: 3. Since the deduced amino acid sequence contains hydrophobic regions corresponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 

30 coupled receptor. 

[0080] The amplification of the novel G protein-coupled receptor GPRv21 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGGAGACCACCATGGGGTTCAT- N 
GGATG-3' (SEQ ID NO: 15) and reverse primer: 5 , -TTATTTTAGTCTGATGCAGTCCACCTCTTC-3 , (SEQ ID NO: 16). 
PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal cycling 

35 profile consisted of preheat at 94°C (2.5 minutes), 5 cycles of 94°C (5 seconds)/72°C (4 minutes), 5 cycles of 94°C (5 
seconds)/70°C (4 minutes), and 25 cycles of 94°C (5 seconds)/68°C (4 minutes). The amplification resulted in about 
1 .2-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 
resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). 
The determined sequence is shown in SEQ ID NO: 8. 

40 [0081] The sequence comprises an open reading frame of 1 1 82 nucleotides. An amino acid sequence deduced from 
the open reading frame (333 amino acids) is shown in SEQ ID NO: 4. Since the deduced amino acid sequence contains 
hydrophobic regions corresponding to seven transmembrane domains characteristic of G protein-coupled receptor, 
the gene is found to encode a G protein-coupled receptor. 

[0082] The amplification of the novel G protein-coupled receptor GPRv40 was carried out using a Marathon Ready 
45 cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGGAGGATCTCTTTAGCCCCT- 

CAATTC-3 1 (SEQ ID NO: 27) and reverse primer: 5 , -CTAGAAGGCACTTTCGCAGGAGCAAGGC-3 , (SEQ ID NO: 28). 

PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal cycling 

profile consisted of preheat at 98°C (2.5 minutes), 5 cycles of 98°C (5 seconds)/72 0 C (4 minutes), 5 cycles of 98°C (5 

seconds)/70°C (4 minutes), and 25 cycles of 98°C (5 seconds)/68°C (4 minutes). The amplification resulted in about 
so 1 .3-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 

resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). 

The determined sequence is shown in SEQ ID NO: 22. 

[0083] The sequence comprises an open reading frame of 1305 nucleotides (SEQ ID NO: 22). An amino acid se- 
quenc deduced from the op n reading frame (434 amino acids) is shown in SEQ ID NO: 17. Sine th deduc damino 
55 acid sequence contains hydrophobic regions corresponding to seven transmembran domains characteristic of G 
prot in-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0084] The amplification of the novel G protein-coupled receptor GPRv47 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetal brain as a template, and forward primer: 5'-ATGGAGTCCTCACCCATC- 
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CCCCAGTCATC-3' (SEQ ID NO: 29) and reverse primer: 5'-TCATGACTCCAGCCGGGGTGAGGCGGCAG-3' (SEQ 
ID NO: 30). PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the 
thermal cycling profile consisted of preheat at 94°C (2 minutes) and 35 cycles of 94°C (30 seconds)/50°C (30 seconds) 
/72°C (1.5 minutes). The amplification resulted in about 1.4-kbp DNA fragments. The fragments were cloned into 

5 pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was determined by dideoxy terminator 
method in an ABI377 DNA Sequencer (Applied Biosystems). The determined sequence is shown in SEQ ID NO: 23. 
[0085] The sequence comprises an open reading frame of 1356 nucleotides (SEQ ID NO: 23). An amino acid se- 
quence deduced from the open reading frame (451 amino acids) is shown in SEQ ID NO: 1 8. Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 

10 protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0086] The amplification of the novel G protein-coupled receptor GPRv51 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human testis as a template, and forward primer: 5'-ATGAACCAGACTTTGAATAGCAGT- 
GG-3' (SEQ ID NO: 31) and reverse primer: S'-TCAAGCCCCCATCTCATTGGTGCCCACG-S' (SEQ ID NO: 32). PCR 
was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 98°C (2.5 

is minutes) and 35 cycles of 98°C (30 seconds)/50°C (30 seconds)/68°C (4 minutes) . The amplification resulted in about 
1 .0-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 
resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems) . 
The determined sequence is shown in SEQ ID NO: 24. 

[0087] The sequence comprises an open reading frame of 966 nucleotides (SEQ ID NO: 24). An amino acid sequence 
20 deduced from the open reading frame (321 amino acids) is shown in SEQ ID NO: 19. Since the deduced amino acid 
s quence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G protein- 
coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0088] The amplification of the novel G protein-coupled receptor GPRv71 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGGAGAAGGTGGACATGAATA- 

25 CATCAC-3 (SEQ ID NO: 33) and reverse primer: 5'-TTACCCAGATCTGTTCAACCCTGGGCATC-3' (SEQ ID NO: 34). 
PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C 
(2.5 minutes) , 5 cycles of 98°C (5 seconds)/72°C (4 minutes), 5 cycles of 98°C (5 seconds)/70°C (4 minutes), and 25 
cycles of 98°C (5 seconds)/68°C (4 minutes). The amplification resulted in about 1 .0-kbp DNA fragments. The frag- 
ments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was determined 

30 by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). The determined sequence is shown 
in SEQ ID NO: 25. , 

[0089] The sequence comprises an open reading frame of 1002 nucleotides (SEQ ID NO: 25). An amino acid se- 
qu nee deduced from the open reading frame (333 amino acids) is shown in SEQ ID NO: 20. Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 

35 protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0090] The amplification of the novel G protein-coupled receptor GPRv72 was carried out using human genome DNA 
(Clontech) as a template, and forward primer: 5'-ATGACGTCCACCTGCACCAACAGCACGC-3' (SEQ ID NO: 35) and 
reverse primer: 5'-TCAAGGAAAAGTAGCAGAATCGTAGGAAG-3' (SEQ ID NO: 36). PCR was carried out with Py- 
robest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C (2 minutes) and 30 cycles 

40 of 94°C (30 seconds)/55°C (30 seconds)/68°C (4 minutes) . The amplification resulted in about 1 .5-kbp DNA fragments. 
The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was 
determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). The determined se- 
quence is shown in SEQ ID NO: 26. 

[0091] The sequence comprises an open reading frame of 1527 nucleotides (SEQ ID NO: 26). An amino acid se- 
43 quence deduced from the open reading frame (508 amino acids) is shown in SEQ ID NO: 21 . Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 
protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[Example 2] BLAST SEARCH of the amino acid sequences of the novel G protein-coupled receptors against 
50 SWISS-PROT 

[0092] The result of BLAST SEARCH of the amino acid sequence of "GPRv8" against SWISS-PROT is shown in 
Figure 1 . "GPRv8 n exhibited the highest homology (36%) to HUMAN VASOPRESSIN V1 B RECEPTOR (P47901 , 424 
aa) of known G protein-coupled r ceptors. Thus, "GPRv8*' was concluded to be a novel G protein -coupled receptor. 
55 [0093] The result of BLAST SEARCH of the amino acid sequ nc of tt GPRv12" against SWISS-PROT is shown in 
Figure 2. M GPRv12" exhibited the highest homology (27%) to RAT 5-HYDROXYTRYPTAMINE 6 RECEPTOR (P31388, 
436 aa) of known G protein-coupled receptors. Thus, GPRvl 2 was concluded to be a novel G protein -coup led receptor. 
[0094] The result of BLAST SEARCH of the amino acid sequence of "GPRvl 6" against SWISS-PROT is shown in 
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Figure 3. "GPRv16" exhibit d the highest homology (28%) to MOUSE GAL AN IN RECEPTOR TYPE 1 (P56479, 348 
aa) of known G protein-coupl d receptors. Thus, M GPRv16 u was concluded to be a novel G protein-coupled receptor. 
[0095] The result of BLAST SEARCH of the amino acid sequence of "GPRv21" against SWISS-PROT is shown in 
Figure4. M GPRv21" exhibited the highest homology (30%) to BOVIN NEUROPEPTIDE Y RECEPTOR TYPE 2 (P791 13, 
5 384 aa) of known G protein-coupled receptors. Thus, u GPRv21" was concluded to be a novel G protein-coupled re- 
ceptor. 

[0096] The result of BLAST SEARCH of the amino acid sequence of "GPRv40" against SWISS-PROT is shown in 
Figure 5. "GPRv40 M was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
(34%) to OXYTOCIN RECEPTOR (P97926, 388 aa). Thus, "GPRv40" was concluded to be a novel G protein-coupled 
w receptor. 

[0097] The result of BLAST SEARCH of the amino acid sequence of M GPRv47" against SWISS-PROT is shown in 
Figure 6. "GPRv47" was not identical to any of known G protein -coupled receptors, but exhibited the highest homology 
(43%) to GPRX_ORYLA PROBABLE G PROTEIN-COUPLED RECEPTOR (Q91178, 428 aa). Thus, "GPRv4r was 
concluded to be a novel G protein-coupled receptor. 
is [0098] The result of BLAST SEARCH of the amino acid sequence of M GPRv51" against SWISS-PROT is shown in 
Figure 7. H GPRv51 " was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
(37%) to PROBABLE G PROTEIN-COUPLED RECEPTOR RTA (P23749, 343 aa). Thus, "GPRv51" was concluded 
to be a novel G protein-coupled receptor. 

[0099] The result of BLAST SEARCH of the amino acid sequence of "GPRv71 M against SWISS-PROT is shown in 
20 Figure 8. "GPRv71 " was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
(45%) to Chicken P2Y PURINOCEPTOR 3 (P2Y3) (Q98907, 328 aa). Thus, "GPRv71" was concluded to be a novel 
G protein-coupled receptor. 

[0100] The result of BLAST SEARCH of the amino acid sequence of "GPRv72" against SWISS-PROT is shown in 
Figure 9. "GPRv72" was not identical to any of known G protein -co up led receptors, but exhibited the highest homology 
25 (30%) to ALPHA-1 A ADRENERGIC RECEPTOR (002824, 466 aa). Thus, "GPRv72" was concluded to be a novel G 
protein-coupled receptor. 

[Example 3] Analysis of tissue-specific expression 
30 1 . Reagents 

1.1. Primers for quantitative polymerase chain reaction (PCR) and Taq Man probes: 

[0101] Sense primers, antisense primers, and Taq Man probes were designed by using genetic analysis software 
35 "Primer Express version 1 .0" from PE Biosystems. The ordinary custom-made primers and TaqMan probes were pur- 
chased from Amersham Pharmacia Biotech (Tokyo) and PE Biosystems Japan, respectively. The TaqMan probes were 
connected with a reporter pigment FAM at the 5' end and with a quencher Tamra at the 3" end. The nucleotide sequences 
of primers and TaqMan probes are shown below. 

40 Synthetic DNA for GPRv8 

[0102] 

45 PCR primer G8.957F: CCAGGAGCGTTTCTATGCCT (SEQ ID NO: 37) 

G8.1082R: TGTGATCTTTGCTCCCTGCA (SEQ ID NO: 38) 

so TaqMan Probe GPRv8.987T: TCAGAACCTGCCAGCATTGAATAGTGCC (SEQ 

ID NO: 39) 

55 
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Synth tic DNA for GPRv12 
[0103] 

5 

PCR primer G12-794F: ATCTGCTTTGCCCCGTATGT (SEQ ID NO: 40) 
G12.903R: ACCGCCTTGCTGTAGGTCAG (SEQ ID NO: 41) 

10 

TaqMan Probe GFRvI2.834T: TCGTGCCCTTCGTCACCGTGAA (SEQ ID NO 

: 42) 

15 Synthetic DNA for GPRv16 
[0104] 

20 PCR primer G16.1133F: C C C A G CAT C C AT A C C AG AAAA (SEQ ID NO: 43) 

G16.1254R: CTGTGTCCCTCTCATGCCAAA (SEQ ID NO: 44) 

25 TaqMan Probe GPRvl 6 . 1 193T : TGAGAAGGCAGAGATTCCCATCCTTCCT (SE 

Q ID NO: 45) 

Synthetic DNA for GPRv21 

30 

[0105] 

PCR primer G21.989F: TCGCCATGAGCAACAGCAT (SEQ ID NO: 46) 
35 G21.1114R: CACTGGACTTACCGCCATTGT (SEQ ID NO: 47) 

TaqMan Probe GPRv21 . 1064T : AGATCATGTTGCTCCACTGGAAGGCTTCT (S 

EQ ID NO: 48) 

40 

Synthetic DNA for GPRv40 
[0106] 

45 

PCR primer G40.16F: GGATCTCTTTAGCCCCTCAATTC (SEQ ID NO: 49) 
G40.99R: AAGGTCAGGTTGAGACCCCAG (SEQ ID NO: 50) 

50 TaqMan Probe GPRv40.53T: AACATTTCCGTGCCCATCTTGCTGG (SEQ ID 

NO: 51) 
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Synthetb DNA for GPRv47 
[0107] 

5 

PCR primer G47.1292F: GCTGTTGACTTTCGAATCCCA (SEQ ID NO: 52) 

G47.1393R: ACGGAGGTAGCTGTCTGACATGA (SEQ ID NO: 53) 

10 

TaqMan Probe GPRv47 . 1336T: TGAGTTCCTGGAGCAGCAACTCACCA (SEQ 

ID NO: 54) 

15 Synthetb DNA for GPRv51 
[0108] 

20 PCR primer G51.190F: GGCTTTCGAATGCACAGGAA (SEQ ID NO: 55) 

G51.276R: GGAAGCCATGCTGAAGAGGA (SEQ ID NO: 56) 

25 TaqMan Probe GPRv51.214T: TTCTGCATCTATATCCTCAACCTGGCGG (SEQ 

ID NO: 57) 

Synthetb DNA for GPRv71 

30 

[0109] 

PCR primer G71.746F: TGGCCTCTTCACCCTCTGTTT (SEQ ID NO: 58) . 
35 G71.841R: ATCAAGAGCTGGCAGTCCTGA (SEQ ID NO: 59) 

TaqMan Probe GPRv71.775T: TCCATATCACTCGCTCCTTCTACCTCACCA (S 

40 EQ ID NO: 60) 

Synthetb DNA for GPRv72 
45 [0110] 

PCR primer G72.101F: CCAAAATGCCCATCAGCCT (SEQ ID NO: 61) 
so G72.190R: GCACTATGTTGCCGACGAAA (SEQ ID NO: 62) 

TaqMan Probe GPRv72.132T: CATCCGCTCAACCGTGCTGGTTATCT (SEQ I 

D NO: 63) 
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1 .2. CDNA derived from pati hts 

[01 11] As cDNAs which had been derived from tumor and normal tissues from a single patient, Matched cDNA Pairs 
from Clontech were used. The tissues are lung, stomach, colon, ovary, prostate, uterus, and kidney. 
5 [0112] Some cDNAs derived from following tissues were purchased from BioChain Institute: brain, pancreas, and 
testis from patients with tumor and normal adults; liver from cirrhosis patients and normal adults; kidney from lupus 
disease patients; and the hippocampus and frontal lobe from Alzheimer's disease (AD) patients and normal adults. 

1 .3. Reagents for quantitative PCR: 

10 

[0113] TaqMan Universal PCR Master Mix (PE Biosystems) was used in this assay. TaqMan p-actin Control Reagents 
(PE Biosystems) was used for measuring the internal standard. 

2. Quantitative PCR: 

15 

1 ) Dilution of template cDNA 

[01 1 4] The cDNAs from BioChain were diluted 50 fold with water, and the cDNAs from Clontech were diluted 5 fold 
with water, for use. 

20 

2) Preparation of Master Mix ' ■ t 

[01 15] A reaction solution with the following composition was prepared. 





Reaction volume 


Preparation volume 


2x Master Mix 


12.5 nl 


1380 uJ 


Sense primer (50 u,M) 


0.5 uJ 


55.2 uJ 


Antisense primer (50 u,M) 


0.5 U.I 


55.2 u.l 


TaqMan Probe (5 u.M) 


1uJ 


110.4 u.l 


Template cDNA 


2.5 uJ 


i 


Purified water 


8uJ 


883.2 (it 


Total volume 


25 uJ 


2484 u.l 



35 

3) Preparation of PCR solution 

[01 16] 6 \i\ template cDNA solution was added to 54 uJ Master Mix solution. Then, 25-uJ aliquots of the mixture were 
added in duplicate to the sample wells of a PCR plate to be placed in a device for quantitative PCR. A 25-uJ aliquot of 

40 the above-mentioned Master Mix was added to each of two wells for non-template control. The standard curve was 
produced using eight 1 0-fold serial dilutions of cDNA which had been subcloned into pCEP4 vector, where the dilution 
started from 100 pg4il. A 25-uJ aliquot of each mixture obtained by combining 54 uJ of Master Mix prepared in Section 
2) and 6 u.t of each standard solution prepared above was added into a standard well. Namely, the largest amount of 
th plasmid DNA was 250 pg and the smallest was 25 ag (a: atto, 1 0" 18 ) in the standard wells. After 8-cap strips were 

45 placed to the top of the wells, the bubbles were removed by light centrifugation. 

4) PCR 

[0117] The plate was placed in the device for quantitative PCR (GeneAmp 5700 Sequence Detection System: PE 
so Biosystems), and then the reaction was carried out according to the following cycling program. 

(1) 50°C, 2 minutes: 1 cycle 

(2) 95°C, 1 0 minutes: 1 cycle 
(3) 

55 
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95°C, 15 seconds 
60 *C, 1 minutes 



50 cycles 



5) Quantitative analysis 



[0118] The quantification was carried out according to the operation manual of GeneAmp 5700, and the result was 
outputted. 

3. Results and conclusions: 



[0119] The GPCR expression profiles obtained with the cDNAs from the organs from normal human and those from 
patients with disease were represented as ratios relative to the expression ievet of the actin gene as an internal stand- 
15 ard. The experiment was carried out in duplicate, and the average values are shown in Table 1 . 



Table 1 

relative copy number 

20 CPRvS GPRv12 GPRv16 GPRv21 GPRv40 GPRv47 GPRv51 GPRv71 GPRv72 



^^BrBi^Nprmal 1 !-.-:, " ?1ffiS&i&3R'i -^VC. ..... . 6 - ^ \ ' * 



A 0 

Lung Normal u %* ■ w ■ ■ ■ 1 O 

Tumor I 0 J .0 .11 2 J. .1. J. 

- r "sto^Sh:Nor^^ . ft. ' i • • • ■ -is • o 

25 • ■ ^^t^w >; ^'#^^: v_ v^-^ii^z^^ ., fr&v'; ---i* -JL: J. 

PwwVUeinml*" ~ 0* 0 O 0 4 0 0 0 0 
Tumor* 45 _ 0. 0 .23 Jt 3 4 J. 

OvBry'Normi" 0 0 1 0 2 1 2 1 1 
, n Tumor 0 4 0 0 21 1 3 3 p. 

^ -Tumor " * - ^ife^^^ 
ProSto'Norm^T 0 0 6 0 18 1 3 1 0 
Tumor 6 O 0 _ 0 9 . . 0 .8 3 0. 

35 ^^-^^Vo«»*^"ir--^ir--- t -'-6- tit — -is '6 w a r 

. Tumor 9 O 0 0 28 10 IS 0 0 

Lupus* 25 0 1 0 1 0 310 

"Tir P Vocimpui No7™l' 0 6~ 12 4 0 40 113 2 5 . 2 
40 AD • 16 1 50 3 111 63 55 .12 27_ 

^"FroTSfTbK^^^ , 3^^"f ""'l 



[0120] When a 3-fold or more alternation in the expression level was reproducible, the difference is assessed as 
45 being significant The cDN As derived from the organs marked with 1 ) were purchased from BioChain; and the cDNAs 
derived from the organs without the mark were purchased from Clontech. The disease-dependent differences in the 
expression levels of the respective genes are summarized below. 

[0121] The expression of GPRv8 was undetectable in the normal pancreas and uterus, but GPRv8 was expressed 
at a moderate level after canceration. GPRv8 was strongly expressed in the colon, and was more strongly expressed 
so jn colon cancer. 

[0122] The expression level of GPRv12 was generally low. The expression was undetectable in the normal ovary 
and testis, but was found after canceration. The expression level decreased in the hippocampus with Alzheimer's 
disease. 

[0123] GPRvl 6 was express d in the colon, but was undetectable after canceration. Th expression I vel increas d 
55 in the brain aft r canceration. In the liver, the expression was undetectable after cirrhosis. In th brain of Alzheim r*s 
disease patients, the expression I vel was elevated in the hippocampus. 

[0124] The expression level of GPRv21 was low, and was undetectabl in the colon and testis after canceration. 
[0125] The expression level of GPRv40 increased in the brain and testis after canceration, and decreased in the 
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liver after cirrhosis. 

[0126] The expression level of GPRv47 incr ased in the brain and kidney and decreased in the testis after cancer- 
ation. The expression was undetectable in the liver after cirrhosis. 

[0127] GPRv51 was strongly expressed in the colon, but the expression level decreased after canceration. The 
expression level decreased in thetestis after canceration. The expression level also decreased in the liver after cirrhosis 
as compared to the normal liver. The expression level was low in the brain, but increased in the hippocampus with 
Alzheimer's disease. 

[0128] The expression level of GPRv71 decreased in the colon and kidney after canceration, and the expression 
thereof was undetectable in the liver after cirrhosis. In the patient with Alzheimer's disease, the expression level de- 
creased in the frontal lobe. 

[0129] GPRv72 was expressed strongly in the colon, but the expression thereof was undetectable after canceration. 
The expression level was low in the brain, but increased in the hippocampus with Alzheimer's disease. 

[Example 4] Analysis of GPRv8 with bioinformatics 

1 . Homology search of GPRv8 
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[0130] The amino acid sequence of GPRv8 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv8 had homology 
to the sequences shown in Table 2. Thus, GPRvB was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv8 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-39) is shown in Table 2. 

Table 2 



Hit (ID) 


E-value 


Identities % 


Description 


AE003754 


2e-68 


43 


gene: "CGenr-Drosophila melanogaster 


AF1 47743 


7e-43 


33 


vasotocin VT1 receptor-Gal lus gallus 


AF1 84966 


2e-42 


33 


arginine vasotocin receptor'-Platichthys flesus 


X93313 


4e-42 


36 


mesotocin receptor-giant toad 


X76321 


8e-42 


32 


vasotocin receptor-white sucker 


X87783 


4e-41 


33 


isotocin receptor-white sucker 


X64878 


3e-40 


32 


oxytocin receptor- H. sapiens 


U82440 


7e-40 


32 


oxytocin receptor-Macaca mulatta 



2. Prediction of transmembrane domain 

[0131] The amino acid sequence of GPRv8 was analyzed according to the method of Kyte-Doo little (J. Kyte and R. 
F. Doolittle, (1 982), J. Mot. Biol. % 157,1 05-132.), for obtaining a hydropathy plot and used to predict the transmembrane 
domain. The result showed that GPRv8 had seven transmembrane domains (TM1-TM7) (Figure 10). 

3. HMMPfam search 

[0132] Using the amino acid sequence of GPRv8 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1) :320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2. 1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0133] The result indicated that GPRv8 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 3. 



20 
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Table 3 



Hit 


Score 


Exp ct 


Q 
from 


Q 
to 


Description 


7tm_1 


164.2 


5.1e-51 


66 


330 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



is 4. Amino acid sequence alignment 

[0134] The amino acid sequences of GPRv8 and proteins shown in Tabte 2 were aligned together by using Clustalw 
1 .7 (Figures 1 1 and 12). The result showed that GPRv8 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with "@") participating in specific S-S bonding of GPCR. 

20 

[Example 5] Analysis of GPRv12 with bioinformatics 
1 . Homology search of GPRv12 

25 [0135] The amino acid sequence of GPRv12 was analyzed by searching known sequences (the known sequenc 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1 997) Nucleic Adds Res. 25:3389-3402.). The result showed that GPRv12 had homology 
to the sequences shown in Table 4. Thus, GPRv12 was revealed to be a novel clone having homology to GPCR. The 

30 amino acid sequence of GPRv12 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E -value lower than e-15) is shown in Table 4. 



Table 4 



Hit (ID) 


E-value 


identities % 


Description 


AF208288 


8e-88 


50 


orphan G protein-coupled receptor GPR26-Rattus norvegicus 


L03202 


2e-17 


24 


5-hydroxytryptamine receptor-rat 


L41146 


5e-17 


23 


5-HT6 serotonin receptor-Rattus norvegicus 


S62043 


2e-16 


25 


serotonin receptor 6-rat 


L41147 


2e-16 


24 


5-HT6 serotonin receptor-Homo sapiens 


AF134158 


4e-16 


23 


serotonin 6 receptor-Mus musculus 


L14856 


4e-16 


26 


somatostatin receptor 4- Human 


Y14627 


5e-16 


21 


Dopamine receptor-Cyprinus carpio 


L07833 


6e-16 


26 


somatostatin receptor 4-Homo sapiens 


AF069547 


8e-16 


21 


putative odorant receptor LOR4 
Lampetra fluviatiiis 



2. Prediction of transmembrane domain 

55 [0136] The amino acid sequence of GPRv12 was analyzed according to the method of Kyte-Dootittie (J. Kyte and 
R. F. Doolittle, (1982), J. Mol. BioL t 157,105-132.), for obtaining a hydropathy plot and used to predict the transm m- 
brane domain. The result showed that GPRv12 had seven transmembrane domains (TM1-TM7) (Figure 13). 
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3. HMMPfam search 

[0137] Using the amino acid sequence of GPRv12 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
5 was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sahger.ac.uk/Software/Pfam/index.shtml). 

[0138] The result indicated that GPRv12 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 5. 

10 Table 5 



Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


74.7 


7.7e-23 


22 


294 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0139] The amino acid sequences of GPRv12 and orphan G protein-coupled receptor GPR26- Rattus norvegicus 
25 (AF208288) were aligned together by using Clustalw 1 .7 (Figure 14). The result showed that GPRvl 2 comprise seven 
transmembrane domains (### ###) and Cys (Cys marked with "@") participating in specific S-S bonding of GPCR. 

[Example 6] Analysis of GPRvl 6 with bioinformatics 

30 1 . Homology search of GPRvl 6 

[0140] The amino acid sequence of GPRvl 6 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2,0) 
35 (Altschul, Stephen F. et al. (1 997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRvl 6 had homology 
to the sequences shown in Table 6. Thus, GPRvl 6 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRvl 6 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-1 8) is shown in Table 6. 

40 Table 6 



Hit (ID) 


E-value 


Identities % 


Description 


AF042784 


4e-20 


23 


GALANIN RECEPTOR TYPE 2-Mus musculus 


U30290 


4e-20 


27 


galanin receptor GALR1 -Rattus norvegicus 


U90657 


6e-20 


27 


GALANIN RECEPTOR TYPE 1 -mouse 


AF042782 


7e-20 


25 


galanin receptor type 2-Homo sapiens 


U94322 


1e-19 


24 


galanin receptor type2-Rattus norvegicus 


AF077375 


6e-19 


23 


galanin receptor type2-M us musculus 



2. Prediction of transmembrane domain 

[0141] The amino acid sequ nee of GPRvl 6 was analyzed according to th method of Kyte-Doolittle (J. Kyte and 
55 R. F. Doolittle, (1982), J. Mol. Biol, 157,105-132.), f r obtaining a hydropathy plot and us d to predict the transm m- 
brane domain. Th result showed that GPRvl 6 had sev n transmembrane domains (TM1-TM7) (Figure 15). 
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3. HMMPfam s arch 

[0142] Using the amino acid sequence of GPRv16 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
5 was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl .edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0143] The result indicated that GPRv16 comprises tm7_1 (Rhodopsin family). The result of HMMPfarn search is 
shown in Table 7. 



Table 7 



Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


23.8 


8.3e-7 


155 


306 


7 transmembrane receptor 
(rhodopsin family) 


7tm__1 


13.3 


0.0017 


53 


133 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext; as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0144] The result of sections 3 and 4 are indicated in Figure 16. The result showed that GPRv16 comprise Cys (@) 
participating in specific S-S bonding of GPCR. 

[Example 7] Analysis of GPRv21 with bioinformatics 

1 . Homology search of GPRv21 



[0145] The amino acid sequence of GPRv21 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv21 had homology 
to the sequences shown in Table 8. Thus, GPRv21 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv21 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-35) is shown in Table 8. 



Table 8 



Hit (ID) 


E-value 


Identities % 


Description 


AL121755 


0.0 


89 


G-protein coupled receptor-Human 


AF236082 


0.0 


83 


G-protein coupled receptor GPR73-Mus musculus 


M81490 


9e-37 


34 


neuropeptide receptor-D. melanogaster 


U50144 


3e-36 


30 


type 2 neuropeptide Y receptor-Bos taurus 


U42766 


6e-36 


29 


neuropeptide y2 receptor-Human 


AF037444 


8e-36 


28 


cardioexcitatory receptor-Lymnaea stagnalis 


D86238 


8e-36 


28 


neuropeptideY-Y2 receptor-Mus musculus 


U42389 


8e-36 


29 


neuropeptid y/peptid YY receptor type 2- human 


U76254 


8e-36 


29 


neuropeptide Y receptor type 2-Human 
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2. Prediction of transmembrane d main 

[0146] The amino acid sequence of GPRv21 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mot. BioL, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv21 had seven transmembrane domains (TM1-TM7) (Figure 17). 

3. HMMPfam search 
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[0147] Using the amino acid sequence of GPRv21 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0148] The result indicated that GPRv21 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 9. 

Table 9 



Hit 



7tm 1 



Score 



188.1 



Expect 



1.6e-58 



Q from 



79 



Qto 



338 



Description 



7 transmembrane receptor 
(rhodopsin family) 



Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[01 49] The amino acid sequences of GPRv21 and proteins shown in Table 8 were aligned together by using Clustalw 
1 .7 (Figures 1 8 and 1 9). The result showed that GPRv21 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with "@") participating in specific S-S bonding of GPCR. 

[Example 8] Analysis of GPRv40 with bioinformatics 

1 . Homology search of GPRv40 

[0150] The amino acid sequence of GPRv40 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi. ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv40 had homology 
to the sequences shown in Table 1 0. Thus, GPRv40 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv40 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-11) is shown In Table 10. 

Table 10 



Hit (ID) 


E-value 


Identities % 


Description 


D86599 


1e-13 


23 


oxytocin receptor- Mus sp. 


U 15280 


4e-13 


23 


oxytocin 23 receptor-Rattus norvegicus 


X76321 


1e-12 


22 


vasotocin receptor-white sucker 


X64878 


2e-12 


21 


oxytocin receptor-H .sapiens 


X87783 


2e-12 


21 


isotocin receptor-C.commersoni 


D45400 


3e-12 


23 


vasopressin receptor V1 b-rat 


L37112 


3e-12 


24 


vasopressin receptor subtype 1b-Horn sapi ns 
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Tab) 1 0 (continued) 



Hit (ID) 


E-value 


Identities % 


Description 


U27322 


6e-12 


23 


arginine-vasopressin V1b receptor- Rattus norv gicus 


U82440 


6e-12 


21 


oxytocin receptor- Macaca mulatta , 



2. Prediction of transmembrane domain 

10 [0151] The amino acid sequence of GPRv40 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mot. Biol., 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv40 had seven transmembrane domains (TM1-TM7) (Figure 20). 

3. HMMPfam search 

15 

[0152] Using the amino acid sequence of GPRv40 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1 ):320-322)) was carried out The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 
20 [0153] The result indicated that GPRv40 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 11. 



Table 11 



Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tnrvJ 


26;5 


1.1e-07 


228 


352 


7 transmembrane receptor 
(rhodopsin family) 


7tm_1 


18.1 


5e-05 


59 


181 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0154] The result of section 3 and 4 are indicated in Figure 21 . The result showed that GPRv40 comprise Cys (@) 
40 participating in specific S-S bonding of GPCR. 

[Example 9] Analysis of GPRv47 with bioinformatics 

1 . Homology search of GPRv47 

45 

[0155] The amino acid sequence of GPRv47 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/). GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv47 had homology 
50 to the sequences shown in Table 12. Thus, GPRv47 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv47 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-11) is shown in Table 12. 



Table 12 



Hit (ID) 


E-value 


Identities % 


Description 


D43633 


1e-85 


41 


G protein-coupled 7-transmembrane r ceptor-Medaka fish 
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Table 12 (continued) 



Hit (ID) 


E- value 


Identities % 


Description 


X98133 


2e-28 


27 


histamine H2 receptor-H.sapiens 


M32701 


3e-28 


' 28 


histamine H2 receptor-Canine histamine 


LM- 1 1 H f 




ol 


5-HT6 serotonin receptor-Homo sapiens 


U25440 


8e-28 


26 


histamine H2 receptor-Cavia porcellus 


D49783 


1e-27 


28 


histamine H2 receptor- Human 


U64032 


2e-27 


27 


alpha 1d adrenoceptor-Oryctolagus cuniculus 


S73473 


3e-27 


28 


beta 3-adrenergic receptor-rats 


M74716 


4e-27 


28 


beta-adrenergic receptor- Rat 


S57565 


6e-27 


27 


histamine H2-receptor- rats 



2. Prediction of transmembrane domain 

20 [0156] The amino acid sequence of GPRv47 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. R Doolittle, (1982), J. Mot. BioL, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv47 had seven transmembrane domains (TM1-TM7) (Figure 22). 

3. HMMPfam search 

25 

[0157] Using the amino acid sequence of GPRv47 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 
30 [0158] The result indicated that GPRv47 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 13. ' 



Table 13 



35 


Hit 


Score 


Expect 


Q from 


Qto 


Description 




7tm_1 


137.9 


9.6^43 


59 


341 


7 transmembrane receptor 
(rhodopsin family) 


40 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher, 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



45 4. Amino acid sequence alignment 



[0159] The amino acid sequences of GPRv47 and proteins shown in Table 2 were aligned together by using Clustalw 
1 .7 (Figures 23 to 25). The result showed that GPRv47 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with u @°) participating in specific S-S bonding of GPCR. 

50 

[Example 10] Analysis of GPRv51 with bioinformatics 

1 . Homology search of GPRv51 

55 [0160] The amino acid sequence of GPRv51 was analyzed by searching known sequences (the known sequenc 
databases are produced in EMBL (Release 64, httpy/www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1 997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv51 had homology 
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to the sequences shown in Table 14. Thus, GPRv51 was revealed to be a novel clone having homology t GPCR.The 
amino acid sequence of GPRv51 was analyzed by searching known sequences with an analysis program (BLAST2.0); 
the result (data with the E-value lower than e-1 8) is shown in Table 1 4. 



Table 14 



Hit (ID) 


E-value 


Identities % 


Description 


M35297 


4e-43 


36 


G-protein coupled receptor-Rat 


J03823 


1e-42 


34 


Rat mas oncogene, complete cds. 


M13150 


3e-40 


34 


mas proto-oncogene- Human 


X67735 


1e-39 


35 


Mas proto-oncogene-M.musculus mas 


AL035542 


1e-35 


36 


MAS-related Gprotein-coupled receptor MRG-Human 



15 

2. Prediction of transmembrane domain 

[0161] The amino acid sequence of GPRv51 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Dooiittle, (1982), J. Mol. BioL, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
20 brane domain. The result showed that GPRv51 had seven transmembrane domains (TM1 -TM7) (Figure 26). 

3. HMMPfam search 



[0162] Using the amino acid sequence of GPRv51 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl .edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0163] The result indicated that GPRv51 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 15. 



Table 15 



Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


32.6 


1.4e-09 


44 


78 


7 transmembrane receptor 












(rhodopsin family) 


7tm_1 


30.1 


8.7e-09 


104 


276 


7 transmembrane receptor 












(rhodopsin family) 


Hit: name of the domain deduced 


by the search. 


Score: the higher the value, the higher the reliability. 


Expext: as the value approaches 0, the reliability becomes higher. 


Q from: the start position of the deduced domain. 


Q to: the termination position of the deduced domain. 


Description: explanation of the deduced domain. 



45 4. Amino acid sequence alignment 

[0164] The amino acid sequences of GPRv51 and G-protein coupled receptor- Rat (M35297) were aligned together 
by using Clustalw 1 .7 (Figure 27). The result showed that GPRv51 comprise seven transmembrane domains (### ###). 

so [Example 11] Analysis of GPRv71 with bio informatics 



1 . Homology search of GPRv71 

[0165] Th amino acid s quence of GPRv71 was analyzed by searching known s qu nc s (the known s quenc 
55 databases are produc d in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Rel as 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Steph n F. et al. (1 997) Nucleic Adds Res. 25:3389-3402.). The result show d that GPRv71 had h mology 
to the sequences shown in Table 16. Thus, GPRv71 was revealed to be a novel clone having homology to GPCR. The 
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amino acid sequence of GPRv71 was analyzed by searching known sequences with an analysis program (BLAST2.0); 
the result (data with the E-value lower than e-35) is shown in Table 16. 



Table 16 



Hit (ID) 


E-value 


Identities % 


Description 


AF069555 


9e-44 


44 


G protein-coupled receptorp2y3-Meleagris gallopavo 


X98283 


9e-44 


45 


P2Y PURINOCEPTOR 3-G.domesticus , 


AF031 897 


6e-41 


40 


P2Y nucleotide receptor-Meleagris gallopavo 


X99953 


1e-39 


41 


P2Y PURINOCEPTOR 8- X.laevis 


D63665 


2e-37 


41 


novel G protein -coup led P2 receptor- Rat 


Y14705 


1e-36 


40 


P2Y4 receptor gene-Rattus norveglcus 


AJ277752 


2e-36 


41 


P2Y4 receptor- Mus musculus 



2. Prediction of transmembrane domain 

[0166] The amino acid sequence of GPRv71 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1 982), J. MoL BioL, 157,1 05-1 32.)i for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv71 had seven transmembrane domains (TM1-TM7) (Figure 28). 

3. HMMPfam search 

[0167] Using the amino acid sequence of GPRv71 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1 ; 26 (1) : 320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMM ER version 2.1 (http://hmmer.wust! .edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0168] The result indicated that GPRv71 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 1 7. , 



Table 17 





Hit 


Score 


Expect 


Q from 


Qto 


Description 


35 


7tm_1 


90.6 


7.6e-28 


40 


161 


7 transmembrane receptor 
(rhodopsin family) 


40 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 



[0169] The amino acid sequences of GPRv71 and proteins shown in Table 2 were aligned together by using Clustalw 
1 .7 (Figures 29 and 30). The result showed that GPRv71 comprise seven transmembrane domains (### ###). 

[Example 12] Analysis of GPRv72 with bioinformatics 

1 . Homology search of GPRv72 

[0170] The amino acid sequence of GPRv72 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen R et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv72 had homology 
to the sequences shown in Table 1 8. Thus, GPRv72 was revealed to be a novel clone having homology to GPCR. Th 
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amino acidsequ nee of GPRv72 was analyzed by searching known sequenc s with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-24) is shown in Table 18. 



Table 18 



Hit (ID) 


E-value 


Identities % 


Description 


AFO91890 


4e-29 


32 


G-protein coupled receptor RE2-Homo sapiens 


U81982 


3e-25 


30 


alpha 1a-adrenoceptor-Oryctolagus cuniculus 


S71323 


6e-25 


32 


alpha-1 A adrenergic receptor-Japanese medaka 


D63859 


6e-25 


32 


alphal A-adrenoceptor-Oryzias latipes 


U07126 


8e-25 


29 


alphalc adrenergic receptor- Rattus norvegicus 


U03866 


8e-25 


30 


adrenergic alpha-1c receptor protein-Human 


AF013261 


8e-25 


30 


alpha 1 A adrenergic receptor isoform 4-Homo sapiens 


L31774 


8e-25 


30 


alpha-1 C-adrenergic receptor-Human 


D32202 


8e-25 


30 


alpha 1 C adrenergic receptor isoform 2-Human 


D32201 


8e-25 


30 


alpha 1C adrenergic receptor isoform 3-Human 


D25235 


8e-25 


30 


alphal C adrenergic receptor 



2. Prediction of transmembrane domain 

25 [0171] The amino acid sequence of GPRv72 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mol. BioL, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv72 had seven transmembrane domains (TM1-TM7) (Figure 31). 

3 . H M M Pf am search 

30 

[0172] Using the amino acid sequence of GPRv72 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1) :320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 
35 [0173] The result indicated that GPRv72 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 19. 



Table 19 



40 


Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


196.1 


4.7e-61 


48 


454 


7 transmembrane receptor 
(rhodopsin family) 


45 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[01 74] The amino acid sequences of GPRv72 and proteins shown in Table 1 8 were aligned together by using Clustalw 
1 .7 (Figures 32 to 34). The result showed that GPRv72 comprise seven transmembrane domains (### ###) andCys 
(Cys marked with u @") participating in specific S-S bonding of GPCR. 
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Industrial Applicability 

[0175J The present invention provided novel G protein -coupled receptors (GPRv8, GPRv12, GPRv16, GPRv21 t 
GPRv40, GPRv47, GPRv51 , GPRv71 , and GPRv72), the genes encoding the proteins, vectors containing the genes, 
5 host cells containing the vectors, and a method for producing the proteins. Further, the present invention provided a 
screening method for compounds modifying the activities of the proteins. The proteins and genes of the present in- 
vention, and compounds modifying the activity of the proteins, are expected to be used for the development of new 
preventives and therapeutics for the diseases, with which the G protein -co up led receptors of the present invention are 
associated. 

10 



15 



20 



25 



30 



35 



40 



45 



SO 



BNSDOCID: <EP 1243648A1_I_> 



30 



EP 1 243 648 A1 
SEQUENCE LISTING 

<110> HELIX RESEARCH INSTITUTE 

<120> NOVEL GUANOSINE TRIPHQSPHATE-BINDING PROTEIN-COUPLED RECEPTORS AND GENES 
THEREOF, AND PRODUCTION AND USES THEREOF 

<130> H1-113DPIPCT 

<140> 
<141> 

<150> JP 1999-375152 
<151> 1999-12-28 

<150> JP 2000-101339 
<151> 200O-03-31 

<160> 63 

<170> Patentln Ver. 2. 1 



<210> 1 

<211> 371 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
15 10 15 

Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 
20 25 30 

Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Ph 
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35 



40 



45 



Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe Thr He 
SO 55 60 

Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
65 70 75 80 

Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp Ser Phe 
85 90 95 

Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe Thr Gly 
100 105 110 

Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr Leu Gin 
115 120 125 

Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu Ser He 
130 135 140 

Asp Arg Tyr His Ala He Val Tyr Pro Met Lys Phe Leu Gin Gly Glu 
145 150 156 160 

Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser Phe Leu 
165 170 175 

Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu Ser Asn 
180 185 190 

Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr 
195 200 205 

Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr 
210 215 220 



He lie Ser He Met Tyr Gly II Val II Arg Thr II Trp He Lys 
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225 230 235 240 

Ser Lys Thr Tyr Glu Thr Val lie Ser Asn Cys Ser Asp Gly Lys Lou. 

245 250 255 

Cys Ser Ser Tyr Asn Arg Gly Leu lie Ser Lys Ala Lys He Lys Ala 
260 265 270 

lie Lys Tyr Ser He lie He He Leu Ala Phe He Cys Cys Trp Ser 
275 280 285 

Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu Pro Asp 
290 295 300 

Thr Gin Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu Pro Ala 
305 310 315 320 

Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser 
325 330 335 

He Ser Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg Met Thr 
340 345 350 

Phe Arg Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser Lys Pro 
355 360 365 

Glu Phe He 
370 



<210> 2 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 2 
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10 



15 



Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
.1 5 10 IS 

Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 
20 25 . 30 

Ser Ala Glu Leu Arg Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
35 40 45 

Ser Leu Gly His Leu Leu Leu Ala Ala Leu Asp Met Pro Phe Thr Leu 
50 55 60 



20 



Leu Gly Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
65 70 75 80 



25 



Val He Gly Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 
85 90 95 



30 



Val Ala Ala Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe Pro Leu 
100 105 110 



35 



Arg Tyr Ala Gly Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu Leu Gly 
115 120 125 



40 



Cys Ala Trp Gly Gin Ser Leu Ala Phe Ser Gly Ala Ala Leu Gly Cys 
130 13S 140 



45 



Ser Trp Leu Gly Tyr Ser Ser Ala Phe Ala Ser Cys Ser Leu Arg Leu 
145 150 155 160 



50 



55 



Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala Thr Leu 
165 170 175 

His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 
180 185 190 
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Leu Gin Val His Arg Val Ala Arg Arg His Cys Gin Arg Met Asp Thr 
195 200 205 

Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arg Gin Arg Cys Leu He Gin Gin Lys Arg Arg Arg His Arg Ala Thr 
225 230 235 240 

Arg Lys He Gly lie Ala He Ala Thr Phe Leu He Cys Phe Ala Pro 
245 250 255 

Tyr Val Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 
260 265 270 

25 Ala Gin Trp Gly He Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 

275 280 285 



10 



15 



20 



30 



35 



40 



45 



SO 



Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arg Gin Val 
290 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 
305 310 315 320 

Ser Thr His Asp Ser Ser Leu Asp Val Ala Gly Met Val His Gin Leu 
325 330 335 

Leu Lys Arg Thr Pro Arg Pro Ala Ser Thr His Asn Gly Ser Val Asp 
340 345 350 

Thr Glu Asn Asp Ser Cys Leu Gin Gin Thr His 
355 360 



<210> 3 
55 <211> 419 
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<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Leu Ala Ala Ala Phe Ala Asp Ser Asn Ser Ser Ser Met Asn Val 
1 5 10 15 

Ser Phe Ala His Leu His Phe Ala Gly Gly Tyr Leu Pro Ser Asp Ser 
20 25 30 

Gin Asp Trp Arg Thr He He Pro Ala Leu Leu Val Ala Val Cys Leu 
35 40 45 

Val Gly Phe Val Gly Asn Leu Cys Val lie Gly lie Leu Leu His Asn 
50 55 60 

Ala Trp Lys Gly Lys Pro Ser Met He His Ser Leu He Leu Asn Leu 
65 70 75 80 

Ser Leu Ala Asp Leu Ser Leu Leu Leu Phe Ser Ala Pro, He Arg Ala 
85 90 95 

Thr Ala Tyr Ser Lys Ser Val Trp Asp Leu Gly Trp Phe Val Cys Lys 
100 105 HO 

Ser Ser Asp Trp Phe He His Thr Cys Met Ala Ala Lys Ser Leu Thr 
115 120 125 

He Val Val Val Ala Lys Val Cys Phe Met Tyr Ala Ser Asp Pro Ala 
130 135 140 

Lys Gin Val Ser He His Asn Tyr Thr lie Trp Ser Val Leu Val Ala 
145 150 155 160 

lie Trp Thr Val Ala Ser Leu Leu Pro Leu Pro Glu Trp Phe Phe Ser 
165 170 175 
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Thr lie Arg His His Glu Gly Val Glu Met Cys Leu Val Asp Val Pro 
180 185 190 

Ala Val Ala Glu Glu Phe Met Ser Met Phe Gly Lys Leu Tyr Pro Leu 
195 200 20S 

Leu Ala Phe Gly Leu Pro Leu Phe Phe Ala Ser Phe Tyr Phe Tip Arg 
210 215 220 

Ala Tyr Asp Gin Cys Lys Lys Arg Gly Thr Lys Thr Gin Asn Leu Arg 
225 230 235 240 

Asn Gin He Arg Ser Lys Gin Val Thr Val Met Leu Leu Ser He Ala 
245 250 255 

He He Ser Ala Val Leu Trp Leu Pro Glu Trp Val Ala Trp Leu Trp 
260 265 270 

Val Trp His Leu Lys Ala Ala Gly Pro Ala Pro Pro Gin Gly Phe He 
275 280 285 

Ala Leu Ser Gin Val Leu Met Phe Ser He Ser Ser Ala Asn Pro Leu 
290 295 300 

He Phe Leu Val Met Ser Glu Glu Phe Arg Glu Gly Leu Lys Gly Val 
305 310 315 320 

Trp Lys Trp Met He Thr Lys Lys Pro Pro Thr Val Ser Glu Ser Gin 
325 330 335 

Glu Thr Pro Ala Gly Asn Ser Glu Gly Leu Pro Asp Lys Val Pro Ser 
340 345 350 



Pro Glu Ser Pro Ala Ser He Pro Glu Lys Glu Lys Pro Ser Ser Pro 
355 360 365 
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Ser Ser Gly Lys Gly Lys Thr Glu Lys Ala Glu II Pro He Leu Pro 
370 375 380 

Asp Val Glu Gin Phe Trp His Glu Arg Asp Thr Val Pro Ser Val Gin 
385 390 395 400 

Asp Asn Asp Pro He Pro Trp Glu His Glu Asp Gin Glu Thr Gly Glu 
405 410 415 

Gly Val Lys 



<210> 4 
<211> 393 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Glu Thr Thr Met Gly Phe Met Asp Asp Asn Ala Thr Asn Thr Ser 
1 5 10 15 

Thr Ser Phe Leu Ser Val Leu Asn Pro His Gly Ala His Ala Thr Ser 
20 25 30 

Phe Pro Phe Asn Phe Ser Tyr Ser Asp Tyr Asp Met Pro Leu Asp Glu 
35 40 45 

Asp Glu Asp Val Thr Asn Ser Arg Thr Phe Phe Ala Ala Lys He Val 
50 55 60 

He Gly Met Ala Leu Val Gly He Met Leu Val Cys Gly He Gly Asn 
65 70 75 80 

Phe II Phe He Ala Ala Leu Val Arg Tyr Lys Lys Leu Arg Asn Leu 
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85 90 95 

Thr Asn Leu Leu lie Ala Asn Leu Ala lie Ser Asp Phe Leu Val Ala 
100 105 110 

He Val Cys Cys Pro Phe Glu Met Asp Tyr Tyr Val Val Arg Gin Leu 
115 120 125 

Ser Trp Glu His Gly His Val Leu Cys Thr Ser Val Asn Tyr Leu Arg 
130 135 140 

Thr Val Ser Leu. Tyr Val Ser Thr Asn Ala Leu Leu Ala He Ala He 
145 150 155 160 

Asp Arg Tyr Leu Ala He Val His Pro Leu Arg Pro Arg Met Lys Cys 
165 170 175 

Gin Thr Ala Thr Gly Leu He Ala Leu Val Trp Thr Val Ser He Leu 

180 185 190 

i 

He Ala He Pro Ser Ala Tyr Phe Thr Thr Glu Thr Val Leu Val He 
195 200 205 

Val Lys Ser Gin Glu Lys He Phe Cys Gly Gin He Trp Pro Val Asp 
210 215 220 

Gin Gin Leu Tyx Tyr Lys Ser Tyr Phe Leu Phe He Phe Gly He Glu 
225 230 235 240 

Phe Val Gly Pro Val Val Thr Met Thr Leu Cys Tyr Ala Arg He Ser 
245 250 25S 

Arg Glu Leu Trp Phe Lys Ala Val Pro Gly Phe Gin Thr Glu Gin lie 
260 265 270 

Arg Lys Arg Leu Arg Cys Arg Arg Lys Thr Val Leu Val Leu Met Cys 
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275 280 285 

He Leu Thr Ala Tyr Val Leu Cys Trp Ala Pro Phe Tyr Gly Phe Thr 
290 295 300 

He Val Arg Asp Phe Phe Pro Thr Val Phe Val Lys Glu Lys His Tyr. 
305 310 315 320 

Leu Thr Ala Phe Tyr He Val Glu Cys He Ala Met Ser Asn Ser Met 
325 330 335 

He Asn Thr Leu Cys Phe Val Thr Val Lys Asn Asp Thr Val Lys Tyr 
340 345 350 

Phe Lys Lys He Met Leu Leu His Trp Lys Ala Ser Tyr Asn Gly Gly 
355 360 365 

Lys Ser Ser Ala Asp Leu Asp Leu Lys Thr He Gly Met Pro Ala Thr 
370 375 380 

Glu Glu Val Asp Cys He Arg Leu Lys 
385 390 



<210> 5 
<211> 1116 
<212> DMA 

<213> Homo sapiens 
<400> 5 

atgccagcca acttcacaga gggcagcttc gattccagtg ggaccgggca gacgctggat 60 
tcttccccag tggcttgcac tgaaacagtg acttttactg aagtggtgga aggaaaggaa 120 
tggggttcct tctactactc ctttaagact gagcaattga taactctgtg ggtcctcttt 180 
gtttttacca ttgttggaaa ctccgttgtg cttttttcca catggaggag aaagaagaag 240 
tcaagaatga ccttctttgt gactcagctg gccatcacag attctttcac aggactggtc 300 
aacatcttga cagatattaa ttggcgattc actggagact tcacggcacc tgacctggtt 360 
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tgccgagtgg tccgctattt gcaggttgtg ctgctctacg cctctaccta cgtcctggtg 420 
tccctcagca tagacagata ccatgccatc gtctacccca tgaagttcct tcaaggagaa 480 
aagcaagcca gggtcctcat tgtgatcgcc tggagcctgt cttttctgtt ctccattccc 540 
accctgatca tatttgggaa gaggacactg tccaacggtg aagtgcagtg ctgggccctg 600 
tggcctgacg actcctactg gaccccatac atgaccatcg tggccttcct ggtgtacttc 660 
atccctctga caatcatcag catcatgtat ggcattgtga tccgaactat ttggattaaa 720 
agcaaaacct acgaaacagt gatttccaac tgctcagatg ggaaactgtg cagcagctat 780 
aaccgaggac tcatctcaaa ggcaaaaatc aaggctatca agtatagcat catcatcatt 840 
cttgccttca tctgctgttg gagtccatac ttcctgtttg acattttgga caatttcaac 900 
ctccttccag acacccagga gcgtttctat gcctctgtga tcattcagaa cctgccagca 960 
ttgaatagtg ccatcaaccc cctcatctac tgtgtcttca gcagctccat ctctttcccc 1020 
tgcagggagc aaagatcaca ggattccaga atgacgttcc gggagagaac tgagaggcat 1080 
gagatgcaga ttctgtccaa gccagaattc atctag 1116 

<210> 6 
<211> 1092 
<212> I)NA 

<213> Homo sapiens 
<400> 6 

atgggccccg gcgaggcgct gctggcgggt ctcctggtga tggtactggc cgtggcgctg 60 
ctatccaacg cactggtgct gctttgttgc gcctacagcg ctgagctccg cactcgagcc 120 
tcaggcgtcc tcctggtgaa tctgtctctg ggccacctgc tgctggcggc gctggacatg 180 
cccttcacgc tgctcggtgt gatgcgcggg cggacaccgt cggcgcccgg cgcatgccaa 240 
gtcattggct tcctggacac cttcctggcg tccaacgcgg cgctgagcgt ggcggcgctg 300 
agcgcagacc agtggctggc agtgggcttc ccactgcgct acgccggacg cctgcgaccg 360 
cgctatgccg gcctgctgct gggctgtgcc tggggacagt cgctggcctt ctcaggcgct 420 
gcacttggct gctcgtggct tggctacagc agcgccttcg cgtcctgttc gctgcgcctg 480 
ccgcccgagc ctgagcgtcc gcgcttcgca gccttcaccg ccacgctcca tgccgtgggc 540 
ttcgtgctgc cgctggcggt gctctgcctc acctcgctcc aggtgcaccg ggtggcacgc 600 
agacactgcc agcgcatgga caccgtcacc atgaaggcgc tcgcgctgct cgccgacctg 660 
caccccagtg tgcggcagcg ctgcctcatc cagcagaagc ggcgccgcca ccgcgccacc 720 
aggaagattg gcattgctat tgcgaccttc ctcatctgct ttgccccgta tgtcatgacc 780 
aggctggcgg agctcgtgcc cttcgtcacc gtgaacgccc agtggggcat cctcagcaag 840 
tgcctgacct acagcaaggc ggtggccgac ccgttcacgt actctctgct ccgccggccg 900 
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ttccgccaag tcctggccgg catggtgcac cggctgctga agagaacccc gcgcccagca 960 
tccacccatg acagctctct ggatgtggcc ggcatggtgc accagctgct gaagagaacc 1020 
ccgcgcccag cgtccaccca caacggctct gtggacacag agaatgattc ctgcctgcag 1080 
cagacacact ga 1092 

<210> 7 

<211> 1260 

<212> DNA 

<213> Homo sapiens 

<400> 7 

atgctggcag ctgcctttgc agactctaac tccagcagca tgaatgtgtc ctttgctcac 60 
ctccactttg ccggagggta cctgccctct gattcccagg actggagaac catcatcccg 120 
gctctcttgg tggctgtctg cctggtgggc ttcgtgggaa acctgtgtgt gattggcatc 180 
ctccttcaca atgcttggaa aggaaagcca tccatgatcc actccctgat tctgaatctc 240 
agcctggctg atctctccct cctgctgttt tctgcaccta tccgagctac ggcgtactcc 300 
aaaagtgttt gggatctagg ctggtttgtc tgcaagtcct ctgactggtt tatccacaca 360 
tgcatggcag ccaagagcct gaeaa~tcgtt~gtgftggcca aagtatgctt catgtatgca 420 
agtgacccag ccaagcaagt gagtatccac aactacacca tctggtcagt gctggtggcc 480 
atctggactg tggctagcct gttacccctg ccggaatggt tctttagcac catcaggcat 540 
oatgaaggtg Tggaaaltgtg 600 
atgtttggta agctctaccc actcctggca tttggccttc cattattttt tgccagcttt 660 
tatttctgga gagcttatga ccaatgtaaa aaacgaggaa ctaagactca aaatcttaga 720 
aaccagatac gctcaaagca agtcacagtg atgctgctga gcattgccat catctctgct 780 
gtcttgtggc tccccgaatg ggtagcttgg ctgtgggtat ggcatctgaa ggctgcaggc 840 
ccggccccac cacaaggttt catagccctg tctcaagtct tgatgttttc catctcttca 900 
gcaaatcctc tcatttttct tgtgatgtcg gaagagttca gggaaggctt gaaaggtgta 960 
tggaaatgga tgataaccaa aaaacctcca actgtctcag agtctcagga aacaccagct 1020 
ggcaactcag agggtcttcc tgacaaggtt ccatctccag aatccccagc atccatacca 1080 
gaaaaagaga aacccagctc tccctcctct ggcaaaggga aaactgagaa ggcagagatt 1140 
cccatccttc ctgacgtaga gcagttttgg catgagaggg acacagtccc ttctgtacag 1200 
gacaatgacc ctatcccctg ggaacatgaa gatcaagaga caggggaagg tgttaaatag 1260 



<210> 8 
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<211> 1182 
<212> DNA 

<213> Homo sapiens 
<400> 8 

atggagacca ccatggggtt catggatgac aatgccacca acacttccac cagcttcctt 60 
tctgtgctca accctcatgg agcccatgcc acttccttcc cattcaactt cagctacagc 120 
gactatgata tgcctttgga tgaagatgag gatgtgacca attccaggac gttctttgct 180 
gccaagattg tcattgggat ggccctggtg ggcatcatgc tggtctgcgg cattggaaac 240 
ttcatcttta tcgctgccct ggtccgctac aagaaactgc gcaacctcac caacctgctc 300 
atcgccaacc tggccatctc tgacttcctg gtggccattg tctgctgccc ctttgagatg 360 
gactactatg tggtgcgcca gctctcctgg gagcacggcc acgtcctgtg cacctctgtc 420 
aactacctgc gcactgtctc tctctatgtc tccaccaatg ccctgctggc catcgccatt 480 
gacaggtatc tggctattgt ccatccgctg agaccacgga tgaagtgcca aacagccact 540 
ggcctgattg ccttggtgtg gacggtgtcc atcctgatcg ccatcccttc cgcctacttc 600 
accaccgaga cggtcctcgt cattgtcaag agccaggaaa agatcttctg cggccagatc 660 
tggcctgtgg accagcagct ctactacaag tcctacttcc tctttatctt tggcatagaa 720 
ttcgtgggcc ccgtggtcac catgaccctg tgctatgcca ggatctcccg ggagctctgg 780 
ttcaaggcgg tccctggatt ccagacagag cagatccgca agaggctgcg ctgccgcagg 840 
aagacggtcc tggtgctcat gtgcatcctc accgcctacg tgctatgctg ggcgcccttc 900 
tacggcttca ccatcgtgcg cgacttcttc cccaccgtgt ttgtgaagga gaagcactac 960 
ctcactgcct tctacatcgt cgagtgcatc gccatgagca acagcatgat caacactctg 1020 
tgcttcgtga ccgtcaagaa cgacaccgtc aagtacttca aaaagatcat gttgctccac 1080 
tggaaggctt cttacaatgg cggtaagtcc agtgcagacc tggacctcaa gacaattggg 1140 
atgcctgcca ccgaagaggt ggactgcatc agactaaaat aa 1182 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesiz d primer s quenc 
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<400> 9 

atgccagcca acttcacaga gggcagct 

<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 10 

ctagatgaat tctggcttgg acagaatc 

<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 11 

atgggccccg gcgaggcgct gctggcgg 

<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 12 

tcagtgtgtc tgctgcaggc aggaatca 

<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 13 

atgctggcag ctgcctttgc agactctaac 

<210> 14 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

— <220>~~ 

<223> Description of Artificial Sequence tan artificially 
synthesized primer sequence 

<4Q0> 14- 

ctatttaaca ccttcccctg tctcttgatc 

<210> 15 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 15 

atggagacca ccatggggtt catggatg 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 16 , 
ttattttagt ctgatgcagt ccacctcttc 

<210> 17 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Glu Asp Leu Phe Ser Pro Ser He Leu Pro Pro Ala Pro Asn He 
1 5 10 15 

Ser Val Pro lie Leu Leu Gly Trp Gly Leu Asn Leu Thr Leu Gly Gin 
20 25 30 

Gly Ala Pr Ala Ser Gly Pro Pro Ser Arg Arg Val Arg Leu Val Phe 
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40 45 



Leu Gly Val He Leu Val Val Ala Val Ala Gly Asn Thr Thr Val Leu 
50 55 60 



10 



Cys Arg Leu Cys Gly Gly Gly Gly Pro Trp Ala Gly Pro Lys Arg Arg 
65 70 75 80 



75 



Lys Met Asp Phe Leu Leu Val Gin Leu Ala Leu Alia Asp Leu Tyr Ala 
85 90 95 



20 



Cys Gly Gly Thr Ala Leu Ser Gin Leu Ala Trp Glu Leu Leu Gly Glu 
100 105 110 



25 



Pro Arg Ala Ala Thr Gly Asp Leu Ala Cys Arg Phe Leu Gin Leu Leu 
115 120 125 



30 



35 



40 



45 



50 



Gin Ala Ser Gly Arg Gly Ala Ser Ala His Leu Val Val Leu He Ala 
130 135 140 

-Leu Glu-Arg-Arg Arg Ala Val Arg Leu-Pro His -Gly Arg- Pro Leu Pro 
145 150 155 160 

Ala Arg Ala Leu Ala Ala Leu Gly Trp Leu Leu Ala Leu Leu Leu Ala 

- _ . 165 170 175 

Leu Pro Pro Ala Phe Val Val Arg Gly Asp Ser Pro Ser Pro Leu Pro 
180 185 190 

Pro Pro Pro Pro Pro Thr Ser Leu Gin Pro Gly Ala Pro Pro Ala Ala 
195 200 205 

Arg Ala Trp Pro Gly Gin Arg Arg Cys His Gly He Phe Ala Pro Leu 
210 215 220 



55 



Pr Arg Trp His Leu Gin Val Tyr Ala Phe Tyr Glu Ala Val Ala Gly 
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225 



230 



235 



240 



Phe Val Ala Pro Val Thr Val Leu Gly Val Ala Cys Gly His Leu Leu 
245 250 255 

Ser Val Trp Trp Arg His Arg Pro Gin Ala Pro Ala Ala Ala Ala Pro 
260 265 270 

Trp Ser Ala Ser Pro Gly Arg Ala Pro Ala Pro Ser Ala Leu Pro Arg 
275 * 280 285 

Ala Lys Val Gin Ser Leu Lys Met Ser Leu Leu Leu Ala Leu Leu Phe 
290 295 300 

Val Gly Cys Glu Leu Pro Tyr Phe Ala Ala Arg Leu Ala Ala Ala Trp 
305 310 315 320 

Ser Ser Gly Pro Ala Gly Asp Trp Glu Gly Glu Gly Leu Ser Ala Ala 
325 330 335 

Leu Arg Val Val Ala Met Ala Asn Ser Ala Leu Asn Pro Phe Val Tyr 
340 345 350 

Leu Phe Phe Gin Ala Gly Asp Cys Arg Leu Arg Arg Gin Leu Arg Lys 
355 360 365 

Arg Leu Gly Ser Leu Cys Cys Ala Pro Gin Gly Gly Ala Glu Asp Glu 
370 375 380 

Glu Gly Pro Arg Gly His Gin Ala Leu Tyr Arg Gin Arg Trp Pro His 
385 390 395 400 

Pro His Tyr His His Ala Arg Arg Glu Pro Leu Asp Glu Gly Gly Leu 
405 410 415 



Arg Pro Pro Pro Pro Arg Pr Arg Pr Leu Pro Cys Ser Cys Glu Ser 
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425 



430 



Ala Phe 



<210> 18 
<211> 451 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Glu Ser Ser Pro lie Pro Gin Ser Ser Gly Asn Ser Ser Thr Leu 
1 5 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Phe 
35 40 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
50 55 60 

Ala Val lie Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 
65 70 75 80 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 
85 90 95 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 110 

Val Ala Cys Arg Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 125 
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Ala He Leu Ser Val Ser Ala He Asn Val Glu Arg Tyr Tyr Tyr VaJ 
130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu He 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 
225 230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro He Ser Thr Gly 
305 310 315 320 



50 



EP 1 243 648 A1 



Gin Val Glu Ser Val Val Thr Trp He Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin He Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser lie Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg He Pro Gly 
405 410 415 

Gin He Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp He He Met Ser Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 440 445 

Leu Glu Ser 
450 



<210> 19 
<211> 321 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Met Asn Gin Thr Leu Asn Ser Ser Gly Thr Val Glu Ser Ala Leu Asn 
15 10 15 
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Tyr Ser Arg Gly Ser Thr Val His Thr Ala Tyr Leu Vai Leu Ser Ser 
20 25 30 

Leu Ala Met Phe Thr Cys Leu Cys Gly Met Ala Gly Asn Ser Met Val 
35 40 45 

lie Trp Leu Leu Gly Phe Arg Met His Arg Asn Pro Phe Cys He Tyr 
50 55 60 

He Leu Asn Leu Ala Ala Ala Asp Leu Leu Phe Leu Phe Ser Met Ala 
65 70 75 80 

Ser Thr Leu Ser Leu Glu Thr Gin Pro Leu Val Asn Thr Thr Asp Lys 
85 90 95 

Val His Glu Leu Met Lys Arg Leu Met Tyr Phe Ala Tyr Thr Val Gly 
100 105 HO 

Leu Ser Leu Leu Thr Ala He Ser Thr Gin Arg Cys Leu Ser Val Leu 
115 120 125 

Phe Pro He Trp Phe Lys Cys His Arg Pro Arg His Leu Ser Ala Trp 
130 135 140 

Val Cys Gly Leu Leu Trp Thr Leu Cys Leu Leu Met Asn Gly Leu Thr 
145 150 155 160 

Ser Ser Phe Cys Ser Lys Phe Leu Lys Phe Asn Glu Asp Arg Cys Phe 
165 170 175 

Arg Val Asp Met Val Gin Ala Ala Leu He Met Gly Val Leu Thr Pro 
180 185 190 



Val Met Thr Leu Ser Ser Leu Thr Leu Phe Val Trp Val Arg Arg Ser 
195 200 205 
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Ser Gin Gin Trp Arg Arg Gin Pro Thr Arg Leu Phe Val Val Val Leu 
210 215 220 

Ala Ser Val Leu Val Phe Leu lie Cys Ser Leu Pro Leu Ser He Tyr 
225 . 230 235 240 

Trp Phe Val Leu Tyr Trp Leu Ser Leu Pro Pro Glu Met Gin Val Leu 
245 250 255 

Cys Phe Ser Leu Ser Arg Leu . Ser Ser Ser Val Ser Ser Ser Ala Asn 
260 265 270 

Pro Val He Tyr Phe Leu Val Gly Ser Arg Arg Ser His Arg Leu Pro 
275 280 285 

Thr Arg Ser Leu Gly Thr Val Leu Gin Gin Ala Leu Arg Glu Glu Pro 
290 295 300 

Glu Leu Glu Gly Gly Glu Thr Pro Thr Val Gly Thr Asn Glu Met Gly 
305 310 315 320 

Ala 



<210> 20 

<211> 333 

<212> PRT 

<213> Homo sapiens 



<400> 20 

Met Glu Lys Val Asp Met Asn Thr Ser Gin Glu Gin Gly Leu Cys Gin 
15 10 15 

Phe Ser Glu Lys Tyr Lys Gin Val Tyr Leu Ser Leu Ala Tyr Ser II 
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20 



25 



30 



He Phe lie Leu Gly Leu Pro Leu Asn Gly Thr Val Leu Trp His Hie 
35 40 45 



10 



Trp Gly Gin Thr Lys Arg Trp Ser Cys Ala Thr Thr Tyr Leu Val Asa 
50 55 60 



15 



Leu Met Val Ala Asp Leu Leu Tyr Val Leu Leu Pro Phe Leu lie He 
65 70 75 80 



20 



Thr Tyr Ser Leu Asp Asp Arg Trp Pro Phe Gly Glu Leu Leu Cys Lys 
85 90 95 



25 



Leu Val His Phe Leu Phe Tyr He Asn Leu Tyr Gly Ser He Leu Leu 
100 105 110 



30 



Leu Thr Cys He Ser Val His Gin Phe Leu Gly Val Cys His Pro Leu 
115 120 125 



35 



40 



45 



50 



Cys Ser Leu Pro Tyr Arg Thr Arg Arg His Ala Trp Leu Gly Thr Ser 
130 135 140 

Thr Thr Trp Ala Leu Val Val Leu Gin Leu Leu Pro Thr Leu Ala Phe 
145 150 155 160 

Ser His Thr Asp Tyr He Asn Gly Gin Met He Trp Tyr Asp Met Thr 
165 170 175 

Ser Gin Glu Asn Phe Asp Arg Leu Phe Ala Tyr Gly He Val Leu Thr 
180 185 190 

Leu Ser Gly Phe Leu Ser Leu Leu Gly His Phe Gly Val Leu Phe Thr 
195 200 205 



55 



Asp Gly Gin Glu Pro Asp Gin Ala Arg Gly Glu Pro His Glu Asp Arg 
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210 215 220 

Gin His Ser Pro Ser Gin Val His Pro Asp His Pro Thr Gly Val Trp 
225 230 235 240 

Pro Leu His Pro Leu Phe Cys Ala Leu Pro Tyr His Ser Leu Leu Leu 
245 250 255 

Pro His His Leu Leu Ser Ala Phe Ser Gly Leu Pro Ala Leu Asp Gly 
260 265 270 

Ser Gin Cys Gly Leu Gin Asp Met Glu Ala Ser Gly Glu Cys Glu Gin 
275 280 1 285 

Leu Pro Gin Pro Ser Pro Val Leu Ser Phe Lys Gly Gly Lys Asn Arg 
290 295 300 

Val Arg Leu Leu Gin Lys Leu Arg Gin Asn Lys Leu Gly Glu His Pro 
305 310 315 320 

Ala Gly Arg Lys Arg Cys Pro Gly Leu Asn Arg Ser Gly 
325 330 



<210> 21 

<211> 508 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Met Thr Ser Thr Cys Thr Asn Ser Thr Arg Glu Ser Asn Ser Ser His 
15 10 15 

Thr. Cys Met Pro Leu Ser Lys Met Pro lie Ser Leu Ala His Gly He 
20 25 30 
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He Arg Ser Thr Val Leu Val He Phe Leu Ala Ala Ser Phe Val Gly 
35 40 45 

Asn He Val Leu Ala Leu Val Leu Gin Arg Lys Pro Gin Leu Leu Gin 
50 55 60 

Val Thr Asn Arg Phe He Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 
65 70 75 80 

He Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Val Pro Leu Phe 
85 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 
100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr He Val Val Val Ser Val Asp 
115 120 125 

* 

Arg Tyr Leu Ser He He His Pro Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala He 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Gin Ala Ala Phe Asp. 

165 170 175 

Glu Arg Asn Ala Leu Cys Ser Met He Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 

Thr He Leu Ser Val Val Ser Phe lie Val He Pro Leu He Val Met 
195 200 205 



He Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arg Arg Gin His Ala 
210 215 220 



56 



EP 1 243 648 A1 



Leu Leu Tyr Asn Val Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 
225 230 235 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 
260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 
275 280 285 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Val Glu Ala Arg Gly Ser Glu 
290 295 300 



Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arg Thr Glu Val Asn Gin Cys Ser He Asp Leu Gly Glu Asp Asp 
340 345 350 

Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 
355 360 365 

Ala Val Asn He Pro Glu Ser Leu Pro Pro Ser Arg Arg Asn Ser Asn 
370 375 380 



Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Cys Lys Ala Ala Lys Val 
385 390 395 400 

He Phe He He He Phe Ser Tyr Val Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 
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Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pr 
420 425 430 

Gin Trp Val He Thr He He He Trp Leu Phe Phe Leu Gin Cys Cys 
435 440 445 

He His Pro Tyr Val Tyr Gly Tyr Met His Lys Thr He Lys Lys Glu 
450 455 460 

He Gin Asp Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 
465 470 475 480 

Glu Asp Ser His Pro Asp Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 
485 490 495 

Lys lie Val Pro Ser Tyr Asp Ser Ala Thr Phe Pro 
500 505 



<210> 22 
<211> 1305 
<212> DNA 



<213> Homo 


sapiens 




<400> 22 






atggaggatc 


tctttagccc 


ctcaattctg 


ttgctgggct 


ggggtctcaa 


cctgaccttg 


agccgccgcg 


tccgcctggt 


gttcctgggg 


accacagtgc 


tgtgccgcct 


gtgcggcggc 


aagatggact 


tcctgctggt 


gcagctggcc 


gcgctgtcac 


agctggcctg 


ggaactgctg 


gcgtgccgct 


tcctgcagct 


gctgcaggca 


gtgctcatcg 


ccctcgagcg 


ccggcgcgcg 


gcgcgtgccc 


tcgccgccct 


gggctggctg 


ttcgtggtgc 


gcggggactc 


cccctcgccg 


cagccaggcg 


cgcccccggc 


cgcccgcgcc 



ccgccggcgc ccaacatttc cgtgcccatc 60 
gggcaaggag cccctgcctc tgggccgccc 120 
gtcatcctgg tggtggcggt ggcaggcaac 180 
ggcgggccct gggcgggccc caagcgtcgc 240 
ctggcggacc tgtacgcgtg cgggggcacg 3O0 
ggcgagcccc gcgcggccac gggggacctg 360 
tccgggcggg gcgcctcggc ccacctcgtg 420 
gtgcgtcttc cgcacggccg gccgctgccc 480 
ctggcactgc tgctggcgct gcccccggcc 540 
ctgccgccgc cgccgccgcc aacgtccctg 600 
tggccggggc agcgtcgctg ccacgggatc 660 
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ttcgcgcccc tgccgcgctg gcacctgcag gtctacgcgt tctacgaggc cgtcgcgggc 720 

ttcgtcgcgc ctgttacggt cctgggcgtc gcttgcggcc acctactctc cgtctggtgg 780 

cggcaccggc cgcaggcccc cgcggctgca gcgccctggt cggcgagccc aggtcgagcc 840 

cctgcgccca gcgcgctgcc ccgcgccaag gtgcagagcc tgaagatgag cctgctgctg 900 

gcgctgctgt tcgtgggctg cgagctgccc tactttgccg cccggctggc ggccgcgtgg 960 

tcgtccgggc ccgcgggaga ctgggaggga gagggcctgt cggcggcgct gcgcgtggtg 1020 

gcgatggcca acagcgctct caatcccttc gtctacctct tcttccaggc gggcgactgc 1080 

cggctccggc gacagctgcg gaagcggctg ggctctctgt gctgcgcgcc gcagggaggc 1140 

gcggaggacg aggaggggcc ccggggccac caggcgctct accgccaacg ctggccccac 1200 

cctcattatc accatgctcg gcgggaaccg ctggacgagg gcggcttgcg cccaccccct 1260 

ccgcgcccca gacccctgcc ttgctcctgc gaaagtgcct tctag 1305 

<210> 23 

<211> 1356 

<212> DNA 

<213> Homo sapiens 



<400> 23 

atggagtcct cacccatccc ccagtcatca 
caaaccccag gtccctctac tgccagtggg 
tcggaatctg tggccctctt cttcatgctc 
gccgctgtga tggccgtgat cgccaagacg 
cacctctgcc' tggtggacct gctggctgcc 
agctctgccc tctttgacca cgccctcttt 
ctgagcgtgt gctttgtcag cctggccatc 
tactattacg tagtccaccc catgcgctac 
tctgtgctgg tgggtgtgtg ggtgaaggcc 
agggtctcct gggaggaagg agctcccagt 
cacagtgcct actgccagct ttttgtggtg 
ctgctcctca tacttgtggt ctactgcagc 
cagcacgggc cgctgcccac gtggatggag 
agccgctcca cgatggtcac cagctcgggg 
gggggaggga aagcagcagt ggttctcctg 
ttgccctact tctctttcca cctctatgtt 
caggtggaga gtgtggtcac ctggattggc 



gggaactctt ccactttggg gagggtccct 60 
gtcccggagg tggggctacg ggatgttgct 120 
ct gctggact tgactgctgt ggctggcaat 180 
cctgccctcc gaaaatttgt cttcgtcttc 240 
ctgaccctca tgcccctggc catgctctcc 300 
ggggaggtgg cctgccgcct ctacttgttt 360 
ctctcggtgt cagccatcaa tgtggagcgc 420 
gaggtgcgca tgacgctggg gctggtggcc 480 
ttggccatgg cttctgtgcc agtgttggga 540 
gtccccccag gctgttcact ccagtggagc 600 
gtctttgctg tcctttactt tctgttgccc 660 
atgttccgag tggcccgcgt ggctgccatg 720 
acaccccggc aacgctccga atctctcagc 780 
gccccccaga ccaccccaca ccggacgttt 840 
gctgtggggg gacagttcct gctctgttgg 900 
gccctgagtg ctcagcccat ttcaactggg 960 
tacttttgct tcacttccaa ccctttcttc 1020 
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tatggatgtc tcaaccggca gatccggggg gagctcagca agcagtttgt ctgcttcttc 1080 
aagccagctc cagaggagga gctgaggctg cctagccggg agggctccat tgaggagaac 1140 
ttcctgcagt tccttcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 1200 



<210> 24 

<211> 966 

<212> DMA 

<213> Homo sapiens 

<400> 24 

atgaaccaga ctttgaatag cagtgggacc gtggagtcag ccctaaacta ttccagaggg 60 

agcacagtgc acacggccta cctggtgctg agctccctgg ccatgttcac ctgcctgtgc 120 

gggatggcag gcaacagcat ggtgatctgg ctgctgggct ttcgaatgca caggaacccc 180 

ttctgcatct atatcctcaa cctggcggca gccgacctcc tcttcctctt cagcatggct 240 

tccacgctca gcctggaaac ccagcccctg gtcaatacca ctgacaaggt ccacgagctg 300 

atgaagagac tgatgtactt tgcctacaca gtgggcctga gcctgctgac ggccatcagc 360 

acccagcgct gtctctctgt cctcttccct atctggttca agtgtcaccg gcccaggcac 420 

ctgtcagcct gggtgtgtgg cctgctgtgg acactctgtc tcctgatgaa cgggttgacc 480 

tcttccttct gcagcaagtt cttgaaattc aatgaagatc ggtgcttcag ggtggacatg 540 

gtccaggccg ccctcatcat gggggtctta accccagtga tgactctgtc cagcctgacc 600 

ctctttgtct gggtgcggag gagctcccag cagtggcggc ggcagcccac acggctgttc 660 

gtggtggtcc tggcctctgt cctggtgttc ctcatctgtt ccctgcctct gagcatctac 720 

tggtttgtgc tctactggtt gagcctgccg cccgagatgc aggtcctgtg cttcagcttg 780 

tcacgcctct cctcgtccgt aagcagcagc gccaaccccg tcatctactt cctggtgggc 840 

agccggagga gccacaggct gcccaccagg tccctgggga ctgtgctcca acaggcgctt 900 
cgcgaggagc ccgagctgga aggtggggag acgcccaccg tgggcaccaa tgagatgggg 960 
gcttga 966 



cccagcccca agcaggagcc acctgctgtt gactttcgaa 
gagacctctg agttcctgga gcagcaactc accagcgaca 



tcccaggcca gatagctgag 1260 
tcatcatgtc agacagctac 1320 



ctccgtcctg ccgcctcacc ccggctggag tcatga 



1356 



<210> 25 



<211> 1002 



<212> DNA 
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<213> Homo sapiens 
<400> 25 

atggagaagg tggacatgaa tacatcacag gaacaaggtc tctgccagtt ctcagagaag 60 
tacaagcaag tctacctctc cctggcctac agtatcatct ttatcctagg gctgccacta 120 
aatggcactg tcttgtggca cttctggggc caaaccaagc gctggagctg tgccaccacc 180 
tatctggtga acctgatggt ggccgacctg ctttatgtgc tatlgccctt cctcatcatc 240 
acctactcac tagatgacag gtggcccttc ggggagctgc tctgcaagct ggtgcacttc 300 
ctgttctata tcaaccttta cggcagcatc ctgctgctga cctgcatctc tgtgcaccag 360 
ttcctaggtg tgtgccaccc actgtgttcg ctgccctacc ggacccgcag gcatgcctgg 420 
ctgggcacca gcaccacctg ggccctggtg gtcctccagc tgctgcccac actggccttc 480 
tcccacacgg actacatcaa tggccagatg atctggtatg acatgaccag ccaagagaat 540 
tttgatcggc tttttgccta cggcatagtt ctgacattgt ctggctttct ttccctcctt 600 
ggtcattttg gtgtgctatt cactgatggt caggagcctg atcaagccag aggagaacct 660 
catgaggaca ggcaacacag cccgagccag gtccatccgg accatcctac tggtgtgtgg 720 
cctcttcacc ctctgttttg tgcccttcca tatcactcgc tccttctacc tcaccatctg 780 
ctttctgctt tctcaggact gccagctctt gatggcagcc agtgtggcct acaagatatg 840 
gaggcctctg gtgagtgtga gcagctgcct caacccagtc ctgtactttc tttcaagggg 900 
ggcaaaaata gagtcaggct cctccagaaa ctgaggcaga acaagttggg tgagcatcca 960 
gctgggagga agagatgccc agggttgaac agatctgggt aa 1002 

<210> 26 
<211> 1527 
<212> DNA 

<213> Homo sapiens 
<400> 26 

atgacgtcca cctgcaccaa cagcacgcgc gagagtaaca gcagccacac gtgcatgccc 60 
ctctccaaaa tgcccatcag cctggcccac ggcatcatcc gctcaaccgt gctggttatc 120 
ttcctcgccg cctctttcgt cggcaacata gtgctggcgc tagtgttgca gcgcaagccg 180 
cagctgctgc aggtgaccaa ccgttttatc tttaacctcc tcgtcaccga cctgctgcag 240 
atttcgctcg tggccccctg ggtggtggcc acctctgtgc ctctcttctg gcccctcaac 300 
agccacttct gcacggccct ggttagcctc acccacctgt tcgccttcgc cagcgtcaac 360 
accattgtct tggtgtcagt ggatcgctac ttgtccatca tccaccctct ctcctacccg 420 
tccaagatga cccagcgccg cggttacctg ctcctctatg gcacctggat tgtggccatc 480 
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ctgcagagca ctcctccact ctacggctgg ggccaggctg cctttgatga gcgcaatgct 540 
ctctgctcca tgatctgggg ggccagcccc agctacacta ttctcagcgt ggtgtccttc 600 
atcgtcattc cactgattgt catgattgcc tgctactccg tggtgttctg tgcagcccgg 660 
aggcagcatg ctctgctgta caatgtcaag agacacagct tggaagtgcg agtcaaggac 720 
tgtgtggaga atgaggatga agagggagca gagaagaagg aggagttcca ggatgagagt 780 
gagtttcgcc gccagcatga aggtgaggtc aaggccaagg agggcagaat ggaagccaag 840 
gacggcagcc tgaaggccaa ggaaggaagc acggggacca gtgagagtag tgtagaggcc 900 
aggggcagcg aggaggtcag agagagcagc acggtggcca gcgacggcag catggagggt 960 
aaggaaggca gcaccaaagt tgaggagaac agcatgaagg cagacaaggg tcgcacagag 1020 
gtcaaccagt gcagcattga cttgggtgaa gatgacatgg agtttggtga agacgacatc 1080 
aatttcagtg aggatgacgt cgaggcagtg aacatcccgg agagcctccc acccagtcgt 1140 
cgtaacagca acagcaaccc tcctctgccc aggtgctacc agtgcaaagc tgctaaagtg 1200 
atcttcatca tcattttctc ctatgtgcta tccctggggc cctactgctt tttagcagte 1260 
ctggccgtgt gggtggatgt cgaaacccag gtaccccagt gggtgatcac cataatcatc 1320 
tggcttttct tcctgcagtg ctgcatccac ccctatgtct atggctacat gcacaagacc 1380 
attaagaagg aaatccagga catgctgaag aagttcttct gcaaggaaaa gcccccgaaa 1440 
gaagatagcc acccagacct ' gcccggaaca gagggtggga ctgaaggcaa gattgtccct 1500 
tcctacgatt ctgctacttt tccttga 1527 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 27 

atggaggatc tctttagccc ctcaattc 28 

<210> 28 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 28 

ctagaaggca ctttcgcagg agcaaggc 
<210> 29 

<211> 29 ( 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ran artificially 
synthesized primer sequence 

<400> 29 

atggagtcct cacccatccc ccagtcatc 

<210> 30 
<211> 29 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 30 

tcatgactcc agccggggtg aggcggcag 
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<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 31 

atgaaccaga ctttgaatag cagtgg 

<210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 32 

tcaagccccc atctcattgg tgcccacg 

<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 33 
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atggagaagg tggacatgaa tacatcac 

<210> 34 
<211> 29 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -'an artificially 
synthesized primer sequence 

<400> 34 

ttacccagat ctgttcaacc ctgggcatc 

<210> 35 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 35 

atgacgtcca cctgcaccaa cagcacgc 

<210> 36 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
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synthesized primer sequence 
<400> 36 

tcaaggaaaa gtagcagaat cgtaggaag 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 37 

ccaggagcgt ttctatgcct 

<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 38 

tgtgatcttt gctccctgca 

<210> 39 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



66 



EP 1 243 648 A1 



<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc.binding 
<222> (1) 

<223> Label FAM (6-carboxy-fluorescein) 
<220> 

<221> misc_binding 

<222> (28) 

<223> Label TAMRA 

(6~carboxy-N, N, N* , N* -tetramethy Irhodamine) 

<400> 39 

tcagaacctg ccagcattga atagtgcc 

i 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 40 

atctgctttg ccccgtatgt 

<210> 41 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 41 

accgccttgc tgtaggtcag 

<210> 42 
<211> 22 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc.biriding 
<222> (1) 

<223> Label FAM (6-carboxy-fluorescein) 
<220> 

<221> miscjunding 

<222> (22) 

<223> Label TAMRA 

(6-carboxy-N, N, N\ N* -tetramethylrhodamine) 

<400> 42 

tcgtgccctt cgtcaccgtg aa 

<210> 43 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence . , 

<400> 43 

cccagcatcc ataccagaaa a 

<210> 44 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence , 

<400> 44 

ctgtgtccct ctcatgccaa a 

<210> 45 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqUan probe sequence 

<220> 

<221> niisc.binding 
<222> (1) 
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<223> Label FAM (6-carb xy-fluorescein) 
<220> 

<221> miscjnnding 

<222> (28) 

<223> Label TAMRA 

(6-carboxy-N, N, N* , tV -tetramethylrhodamine) 

<40O> 45 

tgagaaggca gagattccca tccttcct 

<210> 46 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 46 

tcgccatgag caacagcat 

<210> 47 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 47 

cactggactt accgccattg t 



> 
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<210> 48 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> nisc.binding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> miscbinding 

<222> (29) 

<223> Label TAMRA 

(6-carboxy-N, N, N* , W -tetrainethy Irhodaraine) 1 

<400> 48 

agatcatgtt gctccactgg aaggcttct 

<210> 49 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 49 
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ggatctcttt agcccctcaa ttc 



<210> 50 
<2ll> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :an artificially 
synthesized primer sequence 

<400> 50 

aaggtcaggt tgagacccca g 

<210> 51 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ran artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc.binding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (25) 

<223> Label TAMRA 

(6-carboxy-N, N, VC , N* -tetramethylrhodamine) 
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<400> 51 

aacatttccg tgcccatctt gctgg 

<210> 52 
<21l> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 52 

gc tgttgact ttcgaatccc a 
<210> 53 

<211> 23 , 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 53 

acggaggtag ctgtctgaca tga 

<210> 54 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Descripti n of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc.binding 
<222> (1) 

<223> Label FAM (6-carboxy^fluorescein) 
<220> 

<221> misc_binding 

<222> (26) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N* -tetramethylrhodamine) 

<400> 54 

tgagttcctg gagcagcaac tcacca 

<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 55 

ggctttcgaa tgcacaggaa 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequenc 
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<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 56 

ggaagccatg ctgaagagga 

<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

i 

<220> 

<223> Description of Artificial Sequence tan artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc.binding i 
<222> (1) 

<223> Label FAM (6-carboxr-f iuorescein) 
<220> 

<221> aaisc_binding 

<222> (28) 

<223> Label TAMRA 

(6-carboxy-N, N, H f 9 N* -tetramethylrhodamine) 

<400> 57 

ttctgcatct atatcctcaa cctggcgg 

<210> 58 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence tan artificially 
synthesized primer sequence 

<400> 58 

tggcctcttc accctctgtt t 

<210> 59 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 59 

atcaagagct ggcagtcctg a 

<210> 60 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence'-an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 

<223> Label FAM (6-carboxy^fluorescein) 
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<220> 

<221> misc_binding 

<222> (30) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N* -tetramethylrhodamine) 

<400> 60 

tccatatcac tcgctccttc tacctcacca 
<210> 61 

<211> 19 , 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

i 

<400> 61 

ccaaaatgcc catcagcct 

<210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 62 

gcactatgtt gccgacgaaa 
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15 



<210> 63 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized TaqHan probe sequence 



<220> 

<221> misc_binding 
20 <222> (1) 

<223> Label FAM (6-carboxy-fluorescein) 



<220> 

25 



30 



<221> misc.binding 

<222> (26) 

<223> Label TAMRA 

(6-carboxy-N, N, N* , N* ^tetramethylrhodamine) 



<400> 63 

35 catccgctca accgtgctgg ttatct 26 



Claims 

40 

1. A DNA that encodes a guanosine triphosphate-binding protein-coupled receptor, wherein said DNA is selected 
from the group consisting of the following (a) to (d): 

(a) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
*3 to 21; 

(b) a DNA comprising a coding region of the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26; 

(c) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
to 21 in which one or more amino acids are substituted, deleted, added, and/or inserted; and 

(d) a DNA hybridizing under stringent conditions to the DNA comprising the nucleotide sequence of any one 
so of SEQ ID NOs: 5 to 8 and 22 to 26. 

2. A DNA encoding a partial peptide of a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 
to 4 and 17 to 21. 

55 3. A vector comprising the DNA of any one of claims 1 and 2. 

4. A transformant carrying the DNA of any on of claims 1 and 2 or the vector of claim 3. 
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5. A protein or a peptide encoded by the DNA of any one of claims 1 and 2. 

6. A method for producing the protein or the peptide of claim 5, said method comprising the steps of culturing the 
transformant of claim 4 and recovering an expressed protein or peptide from the transformant or culture supernatant 

s thereof. 

7. A method of screening for ligands that bind to the protein of claim 5, said method comprising the steps of: 

(a) contacting a test sample with the protein or the peptide of claim 5; and 
10 (b) selecting compounds that binds to said protein or said peptide. 

8. A method of screening for compounds that have activity of inhibiting the binding between the protein of claim 5 
and a ligand thereof, said method comprising the steps of: 

is (a) contacting the protein of claim 5 or a partial peptide thereof with the ligand in the presence of a test sample 

and detecting a binding activity of said protein or said partial peptide with said ligand; and 

(b) selecting compounds that reduces the binding activity detected in step (a) as compared with a binding 
activity detected in the absence of the test sample; 

20 9. A method of screening for compounds that inhibit or enhance activity of the protein of claim 5, said method com- 
prising the steps of: 

(a) contacting a ligand of said protein with cells expressing said protein in the presence of a test sample; 

(b) detecting an alteration in the cells that results from binding of said ligand to said protein; and 

25 (c) selecting compounds that suppress or enhance the alteration detected in step (b) as compared with an 

alteration detected in the cells in the absence of the test sample. 

10. The method of claims 8 or 9, wherein the alteration in cells is a change in cAMP concentration or calcium concen- 
tration. 

30 i 

11. An antibody binding to the protein of claim 5. 

12. A compound isolated by the method of any one of claim 7 to 10. 

35 13. A pharmaceutical composition comprising the compound of claim 12 as an active ingredient. 

14. The pharmaceutical composition of claim 1 3, wherein said pharmaceutical composition is formulated for the treat- 
ment of a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease. 

40 15. A polynucleotide comprising at least 15 nucleotides, wherein said polynucleotide is complementary to the DNA 
comprising the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26 or a complementary strand 
thereof. 

16. A method for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease, 
45 said method comprising the steps of detecting expression of the DNA of claim 1 in tissues related to the disease 

derived from a subject, or mutation in the DNA of claim 1 in the subject. 

1 7. An agent for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease, 
said agent comprising the antibody of claim 11 or the nucleotide of claim 15. 

so 
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figure 1 

>sp|P479Q1|V1BR_HUMAN VASOPRESSIN VI B RECEPTOR (AVPR V1B) (VASOPRESSIN V3 
RECEPTOR) (AVPR V3) (ANTIDIURETIC HORMONE RECEPTOR 1B). 
Length = 424 

Score = 316 (111.2 bits). Expect = 3.7e-41. Sum P(2) = 3.7e-41 
Identities = 70/194 (36%), Positives = 115/194 (59X) 



I + V GN VI ♦ + +K+SRM FV LA+TD I +1 + W T F 



PDL+CR V+YLQV+ 44ASTY+L+++++DRY A+ +P++ LQ Qf IHAHL 



♦ +FS+P +IFR+ -K5 + CWA D + V P Y+T ++ +P+T-H+ 

AAI FSLPQVFI FSLREVIQGSGVLDCWA — DFGFPWGPRAYLTWTTLAI FVLPVTMLTA 217 



Query: 


56 


Sbjct: 


41 


Query: 


115 


Sbjct: 


101 


Query: 


174 


Sbjct: 


161 


Query: 


229 


Sbjct: 


218 


Score 


= 131 



Y ++ I +K KT 



Identities = 33/80 (41X), Positives = 47/80 (58X) 

Query: 258 SSYNRGLISKAKIKAIKYSIIIILAFICCWSPYF — LFDILDNFNLLPDTQERFYASVI 314 

SS N IS+AKI+ +K ♦ -H+LA+I CWfP+F ♦+ + 0 N PO A I 
Sbjct: 267 SSI NT — ISRAKIRTVKMTFVIVLAYIACWAPFFSVQMWSVWDK-MA-PDEDSTNVAFTI 322 

Query: 315 IQNLPALNSAINPLIYCVFSSSI 337 

L LNS IIP IY F+S + 
Sbjct: 323 SMLLGNLNSCCNPWIYMGFNSHL 345 
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Figure 2 

>sp|P31388|5HB RAT S-HYDROXYTRYPTAMINE 6 RECEPTOR (5-HT-6) (SEROTONIN RECEPTOR) 
(STtB17). 
Length = 436 

Score = 224 (78.9 bits), Expect = 6.7e-17, P = 6.7e-17 
Identities = 84/309 (27X), Positives = 144/309 (46X) 



Query: 3 PGEA — LLAGLLVMVLAVALLSNALVLLCCAYSAELRTRASGVLLVHLSLGHLLLAALDM 60 

PG 4- + A L V+++ A ++ U+L C A LR S LV+L L++ ♦ M 
Sbjct: 23 PGGSGWVAAALCW I VLTAAANSL L I VL I CTQPA-L RN-TSNFF L VSL RSDLMVGLVVM 80 

Query: 61 PFTLLGVMRGRTPSAPGACQVIGFLDTFLASNAALSVMLSADQWLAVGFPLRYAGRLR- 119 

P+L+GR-A6C + D S + L++ +S D++L + PLRY R+ 
Sbjct: 81 PPAMi. NAL YGRWVLARGLCL LWTAFOVMCGSAS I LN LCL I SLDRYLL I LSPLRYKLRMTA 140 

Query: 120 PRYAGLLLGCAWGQSLAFSGAALGCSWLGYSSAFASCSLRLPPEPERPRFAA — FTATL 176 

PR L+LG AW SLA AL S+L + P P ♦ R A F 

Sbjct: 141 PRALAL I LG-AW— SLA ALA-SFLPLLLGWHELGKARTPAPGQCRLLASLPFVLVA 192 

Query: 177 HAVGF VLPLAVLCLTSLQVHRVAR RHCQRMDTVT MKALALLADLHPSVR 225 

V F LP +C T ++ AR+ 1 1 ++ ^ + p ^ 

Sbjct: 193 SGVTFFLPSGAICFTYCRILLAARKQAVQVASLTTGTAGQALETLQVPRTPRPGMESADS 252 

Query: 226 ORCLIQQKRRRHRATRKIGIAIATFLICFAPYVNTRLAELVPF\rTVNAQWGILSKCLTYS 285 

+R + R+ +A+ -K5I + F + + P+ ♦ +A+ V ++L+ IY 

Sbjct: 253 RRLATKHSRKALKASLTLGILLGMFFVTWLPFFVANIAQAVCDCISPGLFDVLT-WLGYC 311 

Ouery: 286 KAVAOPFTYSL LRRPFRQVL 305 . 
+ +P Y L R Ft+ L 

Sbjct: 312 HSTMNP 1 1 YPL FMRDFKRAL 331 
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Figure 3 



>sp|P56479|GALR_M0USE GALANIN RECEPTOR TYPE 1 (GAL1-R) (GALR1). 
Length = 348 

Score = 269 (94.7 bits). Expect * 7.9e-24, P = 7.9e-24 
Identities = 82/289 (28%), Positives = 136/289 (47X) 

Query: 49 VGFVGNLCVIGI LLHNAWKGKP-SMIHSL I LNLSLADLSLLIFSAPI RATAYSKSVWDLG 107 

««N VIH ♦ GKP S + ILNLS+ADL+ LLF P +AT Y+ W LG 
Sbjct: 46 MGVLGKSLVITVLARSK-PGKPRSnHLFILNLSIADLAYLLFCIPFQATVYALPTWVLG 104 

Query: 108 WFVCKSSDWF I HTCMAAKSLTI VWA — KVCFMYAS DPAKQVS IHNYT IWSVL VAI VTVA 165 

F4CK +F M T+ ++ + + S + + + + V IW ++ 
Sbjct: 105 AFICKFIHYFFTVSMLVSIFTLAAMSVDRYVAIVHSRRSSSLRVSRNALLGVGF-ItfALS 162 

Query: 166 SLLPLPEWFFSTIRHHEGVE-MCLVDVPAVAEEFMSMFGKLYPL— LAFG— LPLFFASF 220 

♦ P++H+. + C P +KY4 FGLPL F 

Sbjct: 164 IAMASPVAYHQRLFHRDSNQTFCWEQWPN KlHKKAYVVCTFVFGYLLPLLl I CF 217 

Query: 221 YFWRAYDQCKKRGTKTQHLRNQI RSKQVTVMLLSIAI ISAVIWLPEWVAWLWVWHLKAAG 280 

♦ ♦ + K+ K + ♦++ K+ H + +♦ + WLP V LW A 
Sbjct: 218 CYAKVLMHLHKK-LKNMSKKSEASKKKTAQTVLVWVVFGISWLPHHVVHLWAEF — GAF 274 

Query: 281 PAPPQGFI—ALSQVLMFSISSANPLIFLVMSEEFREGLKGVWKWmTKKPPTVSESQE 337 

P P F 4 L+SSS NP+I+ +SE FR+ K V+K ♦ + P SE++E 
Sbjct: 275 PLTPASFFFRITAHCLAYSNSSVNPI I YAFLSENFRKAYKQVFKCHVCDESPR-SETKE 332 
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Figure 4 



>sp:NY2R_B0VIN-WEUR0PEPTIDE Y RECEPTOR TYPE 2 (HPY2-R). 
Length = 384 

Score = 153 bits (383), Expect = 5e-37 

Identities = 93/308 (30%), Positives = 164/308 (53%), Gaps = 7/308 (2%) 

Query: 47 DEDED\fTNSRTFFAAKIVIGMALVGIMLVC6lGNFIFIAALVRYKKLRNLTNLLIANLAI 106 

D + ++ +S +4V+ +A I+L+ IGN + I 44++K +R +TN IANLA+ 
Sbjct: 38 DSEPELIDSTKLIEVQWLILAYCSIILLGVIGNSLVIHWIKFKSMRTVTNFFIANLAV 97 

Query: 107 SDFLVAIVCCPFEHDYYWRQLSWEHGHVLCTSVNYLRTVSLYVSTMALIAIAIORYLAI 166 

+0 LV- +C PF ♦ Y ♦ W+ G VLC V Y + ++4 VST L IA+DR+ I 
Sbjct: 98 ADLLVNTLCLPFTLTYTLMGE — WKMGPVLCHLVPYAQGLAVQVSTITLTVIALDRHRCI 155 

Query: 167 VHPL RPRMKCQTATGL I AL VWTVSI L I AI PSA YFTT ETVL VI VKSQEK I FCGQ1MP VDQQ 226 

VU H 0+ +1LWVSL+APAF +++ I* E + C + WP -H+ 
Sbjct: 156 VYHLESKISKQISFLI1GLAVDGVSALLASPLAIFREYSLIEIIPDFEIVACTEKWPGEEK 215 

Query: 227 -LYYKSYFLFI FGI EFVGPWTMTLCYAR ISRELWFKAVPGFQTEQI RKRLRCRRKTVLV 285 

+Y Y L I +V P+ +f Y RI +L PG + +R R+KT ♦ 
Sbjct: 216 GIYGTIYSLSSLLI LYVLPLGI ISFSYTRIWSKLKNHVSPGAAHDHYHQR — RQKTTKM 272 

Query: 286 LMCILTAYVLCWAPFTGFTIVROFFPTVFVKEKHYLTAFYIvECIAMSNSMlNTLCFVTV 345 

L+C++ + + WP + F+ 0 V+KY F ♦ IAM ++ N L + ♦ 
Sbjct: 273 LVCVVWFAVSWLPLHAFQLAVDIDSHV-LDLKEYKLIFTVFHIIAMCSTFANPLLYGVM 331 

Query: 346 KNDTVKYF 353 
++ K F 

Sbjct: 332 NSNYRKAF 339 
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Figure 5 



>sp|P97926|OXYR_M0USE OXYTOCIN RECEPTOR (OT-R). 
Length = 388 

Score = 164 (57.7 bits), Expect = 8.9e-22, Sum P(2) = 8.9e-22 
Identities = 57/166 (34%), Positives = 84/166 (50%) 

Query: 24 WGLNLTLGQGAP ASGPPSR RVRLVFLGVI LWAVAGNTTVLCRLCGG6 71 

W ♦ L LG 6 P +GPP R RV 4 L +IL +A44GN VL L 
Sbjct: 9 WSIELDLGSGVPPGAEGNLTAGPPRRNEALARVEVAVLCLILFLALSGMACVLLAL 64 

Query: 72 GPWA6PKRRKMDFLLVQLALADLYACGGTALSQLAWELLGEPRAATGDLACRFLQLLQAS 131 

K 44 F 4 L++ADL L QL «+♦ R DL CR ++ LQ 

Sbjct: 65 -RTTRHKHSRLFFFMKHLSIADLWAVFQVLPQLLWDITF — RFYGPDLLCRLVKYLQW 121 

Query: 132 GRGASAHLWL I ALERRRAVRLPHGRPLPARA — LAALG-WLLALLLALPPAFV 182 

G AS +L++L++L+R A+ P . R L R LA L WL L4 44P 4 
Sbjct: 122 GMFASTYLLLLMSLDRCLAICQPL-RSLRRRTDRLAVLATWLGCLVASVPQVHI 174 

Score = 155 (54.6 bits), Expect = 8.9e-22, Sum P(2) = 8.9e-22 
Identities = 49/161 (30%), Positives = 85/161 (52%) 

Query: 217 CHGIFAPLPRVKHLQVYAFYEAVAGFVAPvTVLGVACGHLLS— WW— RHRPQAPAAAAP 272 

C +F 4 W + Y 4 4A ++ PV VL AC L+S 4W R + A AAAA 
Sbjct: 187 CWAVF — IQPWGPKAYVTWITLAVYIVPVIVLA-ACYGLISFKIWQNLRLKTAAAAAAAE 243 

Query: 273 WSASPG RAPAPSALPRAKVQSLKMSLLLALLFVGCELPYFAARLAAAWS-SG 323 

S 4 G R 4 + +AK4444KM+ 44 L F4 C P4F 44 4 W 4 • 

Sbjct: 244 GSDAAGGAGRAALARVSSVKL I SKAKI RTVKHTFI I VLAF I VCWTPFFFVCWWSVWDVNA 3Q3 

Query: 324 PAGDWEGEGLSAALRWAMANSALNPFVYLFFQAGXRLRRQLRKRLGSLCCA 376 

P E A4 44A NS NP44Y4 F L +L +R LCC4 
Sbjct: 304 PK — EASAFI I AM-LLASLNSCCNPWIYMLFTG HLFHELVQRF — LCCS 347 
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Figure 6 



>sp|Q91178|GPRX_0RYLA PROBABLE 6 PROTEIN-COUPLED RECEPTOR (FRAGMENT). 
Length - 428 

Score = 823 (289.7 bits), Expect = 9.8e-83, P = 9.8e-83 
Identities = 182/422 (43X), Positives = 266/422 (63X) 

Query: 2 ESSPIPQSSGNSSTLGRVPQTPGPSTASGVPEVGL RDVASESVALFFMLLLDLTAV 57 

+4SP+ S + S P P+ P+V6+ + + LF H+ L+L A+ 

Sbjct: 5 KTSPMITSDHSI SNFSTGLFGPHPTVP — PDVGWTSSQSQMKDLFGLFCMVTLNLIAL 61 

Query: 58 AGNAAVMAVIAKTPALRKFVFVFHLCLVDLLAALTLMPLAMLSSSALFDHALFGEVACRL 117 

N VM IA+ P L+KF FV HLC VD+L A+ LMPL ++SSS F +F + C++ 
Sbjct: 62 LANTGVMVAI ARAPHLKKFAFVCHLCAVDVLCAI LLMPLGI I SSSPFFGTWFTI LECQV 121 

Query: 118 YLFLSVCFVSLAILSVSAINVERYYYWHPMRYEVRMTLGLVASVLVGVWVKALAMASVP 177 

Y+FL+V + L+IL-M4AI+VERY+Y+VHPMRYEV+MT+ LV V++ +W K-H. 4A V 
Sbjct: 122 Y I FLNVFL I WLS I LT 1 TAI SVERYFYI VHPMRYEVKMT I NLVI GVMLL I WFKSLLLALVT 181 

Query: 178 VLGRVSWEEGAPSVPPGCSLQWSHSAYCQLFVWFAVLYFLLPLLLILVVYCSMFRVARV 237 

♦ G + + CSL SHS +F V+F V+ FL Pwl VY ++++VAR 
Sbjct: 182 LFGWPPYGHQSS I AASHCSLHASHSRLRGVFAVL FCV1 CFLAPVWI FSVYSAVYKVARS 241 

Query: 238 AAMQHGP-LPTWME-TP-RQRSESLSSRSTMVTSSGAPQT-TPHRTFGGGKAAWLLAVG 293 

AA+Q P +PTW + +P + RS+S+4S++T++T+ PQ +P R F GGKAA+ t 4- 
Sbjct: 242 AALQQWAVPTWAOASPAKDRSDSINSQTTIITTRTLPQRLSPERAFSGGKAALTLAFIV 301 

Query: 294 GQFL LCWLPYFSFHLYVALSAQP I STGQVESWTWI GYFCFTSNPFFYGCLNRQI RGELS 353 

GQFL+CWLP+F FHL ++L+ S G +E V W+ Y F NP FY6 LNRQIR EL 
Sbjct : 302 GQFLVCWLPFFI FHLQMSLTGSMKSPGDLEEAVNWLAYSSFAVNPSFYGLLNRQI RDELV 361 

Query: 354 K-QFVCFFKPAPEEELRLPSREGSIEENFLQFLQGTGCPSESWVSRPLPSPKQ-EPPAVD 4.11 

K ♦ C +P E+ S EGS 4ENFLQF+0 T SE+ S +P+ E A 
Sbjct: 3S2 KFRRCCVTQPV E 1 GPSSLEGSFQENFLQF IQRTSSSSETHPSFANSNPRNMENQA — 416 

Query: 412 FRIPGQIAEE 421 
+IPGQI EE 

Sbjct: 417 HKIPGQIPEE 426 
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Figure 7 



>sp|P23749|RTA_RAT PROBABLE 6 PROTEIN-COUPLED RECEPTOR RTA : 
Length = 343 

Score = 461 (162.3 bits). Expect = 2.3e-44, P = 2.3e-44 
Identities = 121/323 (37%), Positives = 17B/323 (55X) 

Query: 2 NQTLNSSGTVESALNYSRGS-TVHT-AYL VLSSLAMRCLCGMAGNSMVIWLLGFR 55 

NQ G E+ YSRG T+ A L V ♦ + + CLCG+ GN +V+W GF 
Sbjct: 13 NQNKMCPGMSEALELYSRGFLT1EQIATLPPPAVTNYIFLLLCLCGLVGNGLVLWFFGFS 72 

Query: 56 MHRNPFCIYILNLAAADLLFLFSMASTLSLETQPLVKT-TDKVHELMKRLMYFAYTVGLS 114 

+ R PF IY L+LA4A0 ++LFS A L ++OV+ + + +G+S 
Sbjct: 73 1KRTPFSIYFLHLASADGIYLFSKAVIALLNMGTFLGSFPDYVRRVSRIVGLCTFFAGVS 132 

Query: 115 LLTAISTQRCLSVLFPIWFKCHRPRHLSAWVCGLLWTLCLLMNGLTSSFCSKFL— KFNE 172 

LL AIS -+RC+SV+FP+W+ RP+ LSA VC LLW L L+ + 4 FC Ft + ♦ 
Sbjct: 133 LLPAISIERCVSVIFP^1lm^RRRPKRLSAGVCALLWLLSFLVTSIHNYFCM-aGHEASG 191 

Query: 173 DRCFRVDMVQAALINGVLTPVmSSLUFVWVRRSSQQWRRQPTRLFVWLASVLVFLI 232 

C +0+ L+ + P+M L L L + V +++ R++ 4L WLA V VFL+ 
Sbjct: 192 TACLNMDI SLGI LLFFLFCPLMVLPCLAL I LNVECRARR-RQRSAKLNHWLAI VSVFLV 250 

Ouery: 233 CSLPLSIYWFYLYWL-SLPPEMQVLCFSLSRLSSSVSSSANPVIYFLVGSRRSHRLPTRS 291 

S* L I WF L+W+ 4P ++ L ++SSA P++YFL G +S RL 

Sbjct: 251 SSIYLG1DWF-LFWVFQIPAPFPEY — VTDLCICI NSSAKPI VYFLAGRDKSQRL-WEP 305 

Query: 292 LGTVLQQALRE — EPELEGGETPTVGTNEM 319 

L V Q+ALR+ EP TP TEN 
Sbjct: 306 L RV VFQRAL RDGAEPGDAASSTPNTVTHEM 335 
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Figure 8 

>sp|Q98907|P2Y3 CHICK P2Y PURINOCEPTOR 3, (P2Y3) (NUCLEOSIDE DIPHOSPHATE 
RECEPTOR). 
Length = 328 

Score = 452 (159.1 bits), Expect = 2.0e-43, P = 2.0e-43 
Identities = B5/185 (45X), Positives = 116/185 (62%) 

Ouery: 15 CQFSEKYKQVYLSLAYSIIFILGLPLNGTVLWHFWGQTKRWSCATTYLVNLMVADLLYVL U 

C F E++KQV L L YS++F+LGLPLN V+ W K 4 T Y++NL +ADLLYV 
Sbjct: • 13 CTFHEEFKQVLLPLVYSVVFLLGLPLNAWIGQIWLARKALTRTTIYMLNLAMADLLYVC 72 

Query: 75 -LPFLUTYSLDDRWPFGELLCKLVHFLFYINLYGSILLLTCISVHQFLGVCHPLCSLPY 133 

LP LI Y+ D WPFG+ CK V F FY HL+GSIL LTCISV +4+G4CHPL S 
Sbjct: 73 SLPLLIYNYTOKDYWPFGDFTCKFVRFQFYTNLHGSILFI.TCISVQRYMGICHPLASWHK 132 

Query: 134 RT-RRHAWLGTSTTWALWLQLLPTLAFSHTDYIMGQMIWYDMTSQENFDRLFAYGIVLT 192 

♦ 4+ WL + W +V> Q LPT F+ T + + YD++ + F YGI LT 
Sbjct: 133 KKGKKLTWLVCAAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSTSYFPYGITLT 192 

Query: 193 LSGFL 197 
+«FL 

Sbjct: 193 ITGFL 197 
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Figure 9 



>sp| 002824 |A1AA_RABIT ALPHA- 1 A ADRENERGIC RECEPTOR (ALPHA 1A-ADRENXEPT0R) 
(ALPHA-1C ADRENERGIC RECEPTOR). 
Length = 466 

Score = 295 (103.8 bits), Expect = 1.0e-31, Sum P(2) * 1.0e-31 
Identities = 66/215 (30%), Positives = 113/215 (52%) 

Query: 8 STRESNSSHTCMPLSKMPISLAHGMRSTVLVIFLAASFV6NIVLALVLQRKPQLLQVTN 67 

S S+SS+ P 4 P4444 1+ +L 4. 4GNI44 L 4 L VT4 
Sbjct: 5 SGNASDSSNCTHPPA — PVNISKAILLGVILGGLILFGVLGNILVILSVACHRHLHSVTH 62 

Query: 68 RFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVMTIVWSV 127 

♦I ML V DLL S V P+ 4 +W FC ++ L AS* 4+ V+S+ 
Sbjct: 63 YYI VNLAVADLLLTSTVLPFSAI FEI LGYWAFGRVFCNI WAAVDVLCCTASI ISLCVI SI 122 

Query: 128 DRYLSI I HPLSYPSKMTQRRGYLLLYGTWI VAI LQSTPPLYGWGQAAFDERNALCSM1WG 187 

DRY+ 4 4PL YP4 +TQRRG L W +++ S PL+GW Q A D4 4C 4 

Sbjct: 123 DRYIGVSYPLRYPTIVTQRRGLRALLCVWAFSLVISVGPLFGWRQPAPDDET-IOQI— M 179 

Query: 188 ASPSYTILSWSFIVIPLIVMIACYSWFCAARRQ 222 

P Y 4 S 4 4PL +44A Y V+ A+R+ 
Sbjct: 180 EEPGYVLFSALGSFYVPLTI ILAHYCRVYWAKRE 214 

Score = 106 (37.3 bits), Expect = 1.0e-31, Sum P(2) = 1.0e-31 
Identities = 23/75 (30%), Positives = 41/75 (54%) 

Query: 396 KAAKVIFIIIFSYVLSLGPYCFUVLAVWVDVETQVPQWITIIIWLFFLQCCIHPYVYG 455 

KAAK 4 1+4 4VL -P4— 4— ♦ ♦ + ♦ P+ V 14 WL 4L CI+P +Y 
Sbjct: 269 I^KTLGIWGCFVLCWLPFFLVMPIGSFFP-DFKPPETVFKIVFWLGYLNSCINPIIYP 327 

Query: 456 YMHKT I KKE I QDMLK 470 

4 KK Q44LK 
Sbjct: 328 CSSQEFKKAFONVLK 342 
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Figure 10 




1 51 101 151 201 251 301 351 



Amino acids 
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X64878 

U82440 

X93313 

X87783 

API 84966 
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AF 147743 

GPRv8 

AE003754 



MEGALAAN ffSA-EAA-NASAAPPGAEC — ' — NRTAGPPRRNEALARVEVAVLCL I L 

SSfif AN """ ,,ST " EAV " NSSAAPPCAEG NCTAGPPRRNEALARVEVAVLCLIL 

UEGLCLNL DCS-ELP-NSSIVNSSUENQMHSSNSTRDPLKRNEEVAKVEVTVLALIL 

UEEHFKEQDF-WSFNESSRNSTVGNETFGG NQTVNPLKRNEEVAKVEVTVLALVL 

MEKpCN , TLHp _ NGSDPFGRNEEVAQ I E I WLSITF 

^ii«c^nr^;;;: HGRIANQT7AS ~~~ NDTDpFCRHEEVAK||E, Tvi-svTF 

S^c«!5E;cIS!^! PPHRLLSLTNKS dpvcrperdeclaqveiavlcvif 

MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEVVEGKElGSFYTSFKTEQLITLffVLF 
UKCDHTLFFALFQTEQFAVLI I LF 

t • • 

TBI MtttttMt ttttittt TH2 MJIJttf 

LLALSCNACVLLALRTTRQKHSRLFFFIIKHLS I AOL VVA VFQ VLPQLLKD I TFRFYGPOL 
FLALSGNACVLLALRTTRHKHSRLFFFyKHLS I ADLVVAVFQVLPQLLIO I TFRFYGPOL 
FLALAGN ICVLLGI Yl N RHKHS RHYFF1IKHLS I ADL VVA I FQVLPQL I H> I TFRFYAPOL 
FLALAGNLCVL I A 1 YTAKHTQSRIIYYLMKHLS I ADL VVAVFQVLPQL I ID I TFRFYGPDF 
VVAVIGNVSVLLAMYNTKKKMSRMHLF I KHLSLADLVVAFFQVLPQLCIE I TYRFFGPDF 
FVAVIGNLSVLLAHHNTKKKSSRMHLFJKHLSLADHVVAFFQVLPQLCWE I TFRFYGPDF 
LTASVCNFILILVLVRRRKKLSRMYVFULHLSIADLVVAFFQVLPQLIWDITDVFiGPDF 
tuTui'Su* « ^culc?" ***** RMTFF VTQLA I TOSFTGL VN I LTD I NWRFTGDFTAPDL 
TVI VLGNSAVLFVMFINKNRKSRMNYFIKQLALADLCVGLLNVLTDI IWR ITI SIRAGNL 
** ::: - ....::♦.::*:« :. :: 

fSMIttflft TM3 Uttttfttt ttfttffttt TH4 t 

LCRLVKYLQVVGMFASTYLLLLMSLDRCLA I CQPLRSLRRRT — DRLAVLATILGCLVAS 
LCRL VK YLQ VVGHFAST.Y L LLLKSLD RCLA I CQPLRSLRRRT — DRLAVLATILGCLVAS 
VCRLVTYLQVVGMFASTYIiLLLHSLDRCLA I CQPLRSLHRRS — DCVYVLFTW ILSFLLS 

LCR I VKHLQVTGKF ASTYHMVUHTLDRY I A \ CHPLKTLQQPTQRSY IHI VSTWCSLVFS 
LCR I VKHLQVL GMF AST YMUVMIfT LORY I A I CHPLKTLQQPTQRAYMI GSTffLCSLLLS 
LCR 1 1 KYLQLLGMFASTYH I VVUTVDRYQAVCYPHVTFQKKRALIN I P I CTS1S I SLI LS 
ySlfy!SS!!!!^I A F YVLVSLS,DRYHA,VYPMKFL « ffiK ^ ARV L'VIAISLSFLFS 
ACKA I RFSQVC VTYS STYVLVAHS I DRYDA I THPMNFSKSWKR-ARHLVAGAIL I SALFS 



::♦**: 



• itutiittt tm5 fftttfttftift 

A ESX5 VF0ClrAVF 1 QP— "CPKAY I T» I TLAVY 1 VP V I VLATCYGL I S 

JKJ! "LREVADG- VFDMAVF I QP — IGPKAY I T8 I TUVY I VPV I VLAACYGL I S 
Hill "MP^C-^CRADFIQP-IGPKAYITilTUVYI IPVBILSVCYCLIS 
Hill S-K ' GMC — V ^CIGDFVQP~IGAICAY ITIISLTIYI IPVAI LGGCYGL I S 
TPQYF I FSLSEVKNGSTVKDCIAHF I EP — WGARAY I T8 1 TGGI FLVPVVI LVMCYGF I C 
TPQ YF I FSLSE I QNGS Y VYDCWCHF I EP-tG I RAY ITIITVGIFLIPVI 1LHI CYGF IC 
\ P rS! r . 1 Zl* ' E ' SPC "~ 1 FECBAEF 1 QP-tGPRAVVTi I LVV I FF I PST I L I TCQVK I C 
EThi.' C£227V5!!S — E VQC» ALWPDDS YWTP — Ytrr I VAFL VTF I PLT 1 1 S I MYG 8 V I 

LPILVLYEEKLIQGH PQCtlELCSPIAIQV-YMSLVSATLFAIPALI ISACYAI IV 

^ • • • • • • ▼ • • « 
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Figur 12 



*#•*•*«• IMS 

XB4878 FK I WQNLRLKTAAAAAAEAPEGAAAGDGGRVALARVSSVKL I SKAK I RTVKUTF 1 1 VLAF 

UB2440 FK I IQN L RIKTAAAAAAEAPEGAAAGOGGRUALARVS S VKL I SKAK I RTVKHTFI I VLAF 

XS3313 YK I IQN I RLKTVCESNLRLST SRRATLSRVSSVRL I SKAK I RTVKHTFI I VLAY 

X87783 FK I RQNFKRKTKKOQC I TLTTAA SKANALARVSSVKLVSXAK I TTVKMTFVIVLAY 

AF 1 84966 HT I RKN I K YKKRKT I PGAAS KNGL I GKNSVSSVTT I SRAKLRTVKHTFV I VLAY 

176321 HS 1 1KN I KCK-— T8RGTRNT KDCVIGKVSVSSVTI I SRAKLRTVKHTLVI VLAY 

AF 1 47743 K 1 1 KRN I YVKKQNEYQVTNQ KQVLPSRASSVNC I SKAM I KTVKHT I VTVVAY 

GPftvS RTIIIKSKTYETVISNCSOG KLCSSYNRCLISKAKIKAIKYSI II ILAF 

AE003754 KT I HAKCS I FVPTERAGFGA — APARRASSRGI I PRAKVKTVK1TLT I VFVF 

• • 

IStf*««*S«MMS8 ttffSStftfl TH7 tatltlftl 

X64876 I VC1TPFFF VQMSVWDANAPK — EASAF 1 1 VVLLASLNSCCNPI I YHLFTGHLFHELV 

U82440 IVCRTPFFFVQinSVIIOANAPK EASAF 1 1 VVLLASLNSCCNPI I YHLFTGHLFHELV 

193313 IVCWTPFFFVQHISV1DPNPPK — EASLFI I AHLLGSLNSCCNPI I YMLFTGHLFHDLL 

X87783 I VCtTPFFFVQWSAWPEAPR — EANPFI I SHLLASLNSCCNPf I YHFFAGHLFKDLK 

AF184966 1 1 CRAPFFT VQMISVWDENFQV ADSENT AVT I SALLASLNSCCNPI I TM I FSGHLLQDFH 

X76321 I VCt APFF I VQHRSWDENFSIDDSENAAVTLSALLASLNSCCNPI I YHLFSGHLLYDFL 

AF147743 VLCRSPFFIAQLWSVWFPSGIT EGSAFTI INLLGNLNSCTNP1I YHYFCGHIPY 

CPRvS I CCRSPYFLFD I LDNFNLLPDT-QERFY AS V 1 1 QNLPALNSA INPL I YCVFSSS I SFP~ 

AE0037S4 1 I CISP V 1 1 FDLLQVFGQ I PHS-QTN I A I ATF I QSLAPLNSAANPL I YCLFSSQVFRTLS 

: :: . : . * #*«««.: 



164878 

U82440 

193313 

X87783 

AF1 84966 

176321 

AF 147743 

GPRvS 

AE003754 



-SSSFVLSHRSSSQRSCSQPS 
-SSSFVLSHRSSSQRSCSQPS 
-SSTFVLSRKSSSQKSITQPS 
-CSTYV I KSTSS-flRS I YQSS 
-LLTKUTN-RSPTGSTGNMD 
-LLTKLAACRMTNDGFGSMtD 
— I HLVD-ROPEENSTCA— 

-HEHQ I LS — KP-EF 

RFPPFKIFTCCCKSYRNNSQQNRCHTVGRRLHNSCOSMRTLTTSLTVSRRSTNKTIIARVV 
» : . 



«R- 
QS- 
QS— 
NC— 
RC— 



— FLCCSASVLKCRRLG — ETSASKKSN- 
— FLCCSASYLKGNRLG — ETSTSKKSN- 
— FLCCSARYLKTQQQGS-DLSASRKSN- 
— LLCCSTLYLKSSQCRCDQEHDSRKSN- 
— FAICRRANADFKKED — SDSS I RRTT- 
— FPCCXKPRNULQKED — SDSS I RRNT- 

CTNKQLENTSAQ — EDSVVTGS- 

; — CREQRSQDSRMT FRERTER- 



X64676 
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X93313 

X87783 

AF1 84966 

176321 

AF147743 

GPRvS 

AE0037S4 



TA- 
TA- 
TA- 

17- 



LDNSPK — TS I QUE 

PCHSRKSSQSIGLDCFCKSSQCLEHDCSRKSSQCIPLDCSRKSSQCIPLDCSRKSSQCIIS 



I CERPTKVVTVPAIISERRGVSLKGNTO I L~ 
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Figure 13 




1 51 101 151 201 251 301 351 



Amino acid residues 
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Figure 14 

tmtttttftt TH1 Itfittttltt I9i»lll#tl 'TH2 ftf 

GPRvIZ ORF MCPCEALLACLLVKVLAVALLSNALVLLCCAYSAELRTRASGVLLVNLSLCHLLLAALDM 
AF208288 MNSWDACLACLLVCT I GVSLLSNCLVLLCLLHSAD I RRQAPALFTLNLTCCNLLCTVVMI 

:* tttttt .:.*:«***.**«#* :**::* *:•« :.::* 

ititltU iltSlfSMSt TH3 IISIItlMtt 

CPRv12 ORF PF7LLGVIIRGRTPSAPGACQY I CFLDTFLASNAALSVAALSADQ1LAVCFPLRYACRLRP 

AF208288 PLTLAGVVAQRQPAGDRLCRLAAFLDTFLAANSMLSIIAALS I DR«*AVVFPLS YRAK1IRL 

t.tt tt: ♦ *:: .*«*****:«: $*:***« *:*;*• tt* * .::» 

ttllltSIt TH4 tltttttt f Itlllllft 

CPRv 1 2_ORF RYAGLLLCCAWGQSLAFSGAALCCSIILGYSSAFASCSLRLPPEPERPRFAAFTATLHAVG 
Af 208288 . RDAAEMVAYTILHALTFPATALALSILGFHQLYASCTLCSRRPDERLRFAVFTSAFHALS 
• «.:::. :* ::•:*..:**. ««: . ••«».••:::••:. 

TUS fltttltttli tt 

GPRv12_ORF FVLPLAVLCLTSLQVHRVARRHCQRHDTVTMKALALLAOLHPSVRQRCLIQQKRRRHRAT 
AF208288 FLLSF I VLCFTY LKVLKVARFHCKR I DV I TMQTLVLLVD I HPSVRERCLEEQKRRRQRAT 

: ***:» *:t ;ttt «:*:«.:**::*.♦*.«:***«:*«* :**«*:*** 

Iflttt TUB «f«tftftf»t MSttSMf TH7 ttfMttltt 

CPRv12 ORF • RK I G I A I ATFL I CFAPYVMTRLAELVPFVTVNAQIGI LSKCLTYSKAYADPFTYSLLRRP 
AF208288 KX I STF I GTFLYCFAPYV I TRLVELFSTAP IOSHIGVLSKCLAYSICAASDPF VYSLLRHQ 

*.**♦:******:***.*».. ..::::$*:*****:**«.:***.**»♦*: 



GPRv 1 2_ORF FRQVLAGUVHRLLKRTPRPASTHDSSLDVAGHVHQLLKRTPRPASTHHGSVDTENDSCLfl 

AF208288 YRRSCKELLNR I FNRR S IHSVGLTCDSHSQNILPVSE 

± * « • . .* • 



GPRv12_ORF QTH 
AF208288 
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Figure 16 



******* 
tnuttt Til ##!##• 

1 MLAAAFADSN SSSMNVSFAH LHFAGGYLPS DSQDNRTI IP ALLVAVCLVC FVGNLCVIGI GO 

***************************************************************** 
tffttt TM2 Clttt 

61 LLHNAIKGKP SVIHSLILNL SLADLSLLLF SAP I RAT AYS KSVWDLGWFV CKSSDIFIHT 120 

************** **************************** 
Itiflf TM3 H«l*8ttl# lllft TM4 ##««««### 

121 CUAAKSLTIV VVAKVCFHYA SDPAKQVSIH NYTIWSVLVA IVTVASLLPL PEIFFSTIRH 180 

***************************************************************** 
8 tt$t»i9t TIIS ilStSftttfft 

181 HEGVEUCLVD VPAVAEEFMS HFGKLYPLLA FGLPLFFASF YFWRAYDQCK KRGTKTQNLR 240 

***************************************************************** 
tlttttlftl TMS «IHIttt#t Itllttlt TM7 ft 

241 NQIRSKQVTV MLLSIAIISA VLILPEWVAI LIVIHLKAAG PAPPQGFIAL SQVLNFSISS 300 

****** 
ittfttSttt 

301 ANPL I FLVMS EEFREGLKGV WKVMITKKPP TVSESQETPA GNSEGLPDKV PSPESPASIP 360 
361 EKEKPSSPSS GKGKTEKAEI PILPOVEQFW HERDTVPSVQ DNDP I PWEHE DQETGEGVK 419 
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Figure 17 
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Figure 18 



GPRv21 

AL1217SS 

AF236082 

U42766 

U762S4 

U42389 

US0144 

D86238 

M8H90 

AF037444 



GPRv21 

AL12175S 

AF236082 

U42766 

U76254 

U42389 

U50144 

D86238 

1181490 

AFO 37444 



GPRv21 

AL1 21755 

AF236082 

U42766 

U782S4 

U42389 

U50M4 

D86238 

H81490 

AF037444 



GPRv21 

AL12175S 

AF236082 

U42766 

U7S2S4 

U42389 

US0144 

D86238 

U81490 

AF037444 



-METTNGFNDDNATNTSTSFLSVLNPHGAHA-TSFPFN 



HETTVGAIGENTTDTFTDFFSALDGHEAQT-GSIPFT 

MGP I GAEADENQTVEEHKVEQYGP 

~MGP IGAEADENQTVEEHKVEQYCP 



MGP I GAEADENQTVEEHKVEQYGP- 

-HKIIGPLGAEADEHQTVEEMKVDQFGPG- 



-MVLKNGPVGAEADE8-QTYEVKVEPYGPG- 



HYY I AHQQPHLRNEDDN VQEGYF I RPOPASL I YNTTALPADDEGSNYGYGSTT-TLSGLQ 
— MSIiANSENSTSLFGIKRHADVTGPKSASHDVIOPSNTSVYYDHASNYESVLSTTSTUi 

••tttfflfllt TMl 

FSYSDYDMPL DEDEDVTNSR TFFAAK I V I GMALVG I HLVCG I GNF I F 

— YGDYDLPH DEDEDKTKTR TFFAAK I V I G I AUG I HLVCG I GNFVF 

FSYGDYDHPL DEEEDVTNSR TFFAAK I V I GMALVG I HLVCG I GHF I F 

QTTPRGELVP DPEPEL I DST KL I EVQVVL I LAYCS 1 1 LLGV I GNSLV 

QTTPRGELVP DPEPEL I DST HCL I EVQVVL I LAYCS 1 1 LLGV I GNSLV 

QTTPRGELVP DPEPEL I DST KL I EVQVVL I LAYCS 1 1 LLGV I GNSLV 

KTTLPGELAP DSEPEL I DST -KL I EVQVVL I LAYCS 1 1 LLGV I GNSLV 

HTTPRGELPP DPEPEL I DST — KLVEVQV I L I LAYCS 1 1 LLGVVGNSLV 

FETYNI TVNUNFSCDDYDLLSEDH1 SSAYFK 1 1 VYHLY I P IF IFAL I GNGTV 

LKLTDLVTPFNASEPDPESNGSDTDGGHAA I SEQPMYAK V 1 1 VLHYVL 1 1 LVAVGGNLLF 

$ * " * I • C $ I I • 

llfllfl t»*l#t# TH2 tMMM Sfttttf 

I AALVRYKKLRNLTNLL I ANLA I SDFLVA I VCCPFEHDY YVVRQLSIEHGHVLCTS VNVL 
I AALTRYKKLRNLTNLL ! ANLA I SDFLVA 1 1 CCPFEHDY YVVRQLSWEHGHVLCASVNYL 
ITALARYKKLRNLTNLLIANLAISDFLVAIVCCPFEIIDYYVVRQLSIEHGHVLCASVNVL 
I HVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG— ENKHGPVLCHLVPVA 
IHVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG — EIKNGPVLCHLVPYA 
I HVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG — EIKNGPVLCHLVPYA 
I HVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG— EIKHGPVLCHLVPYA 
I HVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG — ENKMGPVLCHLVPVA 
CY I VYSTPRHRTVTNYF IASLAIGDI LNSFFCEPSSF I SLFI LN-YIPFGLALCHFVNVS 
SYV I VHYPKHRSVTNLFLLNLAI SD I VKAV I CNPFAF IANL1 LL-YIPYGEFNCQVVTYI 
:*.:** :: .**:.*:: .* * : .: * * :* * t 

•t TH3 tlllttlltf tttttfSIt TN4 IfSftftltf 

RTVSLY VSTNALLA I A I DRYLA I VHPLRPRNKCQTATGL I ALVNTVS I L I Al PSAYFTTE 
RTVSLYVST NALLA I A I DRYLA I VHPLKPRNNYQTASFL IALV1NVSILIAI PSAYFATE 
RTVSLYVSTNALLA I A I DRYLA I VHPLRPRNKCQTAAGL I FLVISVS I L I Al PAAYFTTE 
QGLAVQVST ITLTVI ALDRHRC I VYHLESK ISKRISFLI I GLAWG I SALLASPLA I FREY 
QGLAVQVST I TLTV I ALDRHRC I VTHLESK ISKRISFLI I GLAtC I SALLASPLA I FREY 
QGLAVQVST I TLTV I ALDRHRC I VTHLESK ISKRISFLI I GLGIR I SALLASPLA I FREY 
QGLAVQVST I TLTV I ALDRHRC I VYHLESK ISKQISFLI I GLANGVSALLASPLA I FREY 
QGLAVQVST I TLTV I ALDRHRC I VYHLESK I SKR ISFLI I GLAIG I SALLASPLA I FREY 
QAVSVLVSAYTLVA I S I DRY I A I NWPLKPR I TKRYATF 1 1 AGVIF I ALATALP I P I VSGL 
QVVAVFLSAFTLVAHSVDRYVAILKPHRPRLSKRAFAITHATIVILSLSAPLPTAITSRV 

• • ft • • * • . . • ft * • ft 

. • ~ • • **** • ■ • • • • ■ . » • ^ • • • ^ • 
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§ lillttlil THS *ill4tlltl8ff 

GPRv2 I TVLV I VKSQ— EK I FCGQ I IPVDQQ-LYYKSYFLF I FG I EFVGPVVTMTLCYAR I SRELI 

AL 1 2 1 755 TVLF I VKSQ — EKI FCGQ I IPVDQQ-LYYKSYFLF I FGVEFVGPVVTMTLCYAR I SRELI 

AF236082 TVLV I VERQ—EK I FCGQ I ■PVDQQ-FYYRSYFLLVFGLEFVGPVVAMTLCYARVSRELI 

U42766 SL IEI I PDF — EIVACTEKIPGEEKSI YGTVYSLSSLLILYVLPLGI ISFSYTRIISKLK 

U76254 SL 1 El I PDF — EIVACTEKIPGEEKSI YGTVYSLSSLLILYVLPLGI ISFSYTRIISKLK 

U42389 SL I E 1 1 PDF — EIVPCTEKIPAEEKSI YGTVYSLSSLLILYVLPLGI ISFSYTRIISKLK 

U50144 SLIEI I PDF— E I VACTEKIPGEEKGI YGT I YSLSSLL I LYVLPLGI ISFSYTRIISKLK 

085238 SLIEI I PDF— E I VACTEKIPGEEKSVYCTVYSLSTLL I LYVLPLGI ISFSYTRIISKLI 

N81490 D 1 PMSPIHTKCEKY I CREMIPSRSQ — EYYYTLSLFALQFVVPLGVLIFTYARITIRVI 

AF037444 TKQSNSTGL CLEHFENDHN — RYIYSIVIMILQYFVPLAVITVTNTHICYIVI 

I * * * | ■ • ■ • |* • • • • ■ 

llltfttll TH6 tlttttt 

CPRw21 FKAVPG-FQTEQ I RKRLRCRRKTVLVLHC I LTAYVLCIAPFYGFTIVRDFFPTVFVKEKH 

AL 1 2 1 755 FKAVPG-FQTEQ I RKRLRCRRKT VLVLIC I LTAYVLCRAPFYGFT I VRDFFPTVFVKEKH 

AF236082 FKAVPG-FQTEQ I RRTVRCRRRT VLGLVC VLSAYVLCIAPFYGFT I VRDFFPSVFVKEKH 

U42766 NHVSPG-AANDHYHQR — RQKTTKMLVCVVVVFAVSILPLHAFQLAVD- 1 DSQVLDLKE 

U762S4 SHVSPG-AANDHYHQR — RQKTTKMLVCVVVVFAVSILPLHAFQLAVD-I DSQVLDLKE 

U42389 NHVSPG-AANDHYHQR — RQKTTKML VC V VV VFAYSILPLHAFQLAVD- 1 DSQVLDLKE 

U50144 NHVSPG-AAHDHYHQR — RQKTTKNLVCVVVVFAVSWLPLHAFQLAVD-IDSHVLDLKE 

D86238 NHVSPG-AASDHYHQR — RHKVTXMLVCVVVVFAVSILPLHAFQLAVD-IDSHVLDLKE 

N81490 AKRPPGEAETNRDQRMARSKRKUVKMMLTVVIVFTCCILPFNILQLLLN— DEEFAHIDP 

AF037444 IKKTPGEAEEDRDRRHAASKRRLVKHI 1 1 VVV I YAVCILP VHV I TLVGD-HNPD I YNQPH 

• * • • • ••• * • • • * A £ . * , ■ 

• ^ W •••• • • * • . . t * s 

I 

•llltlll TH7 llitttflt 

GPRv21 YLTAFY I VEC I ANSNSH I NTLCFVTVKNDTVKYFKK I ML— LHIKASYNGGKS 

AL121 755 YLTAFYVVEC I ANSNSttl NTVCFVTVKNNTMKYFKKHML LHIRPSQRGSKS 

AF23S082 YLTAFYVVEC I AISNSNI NTLCFVTVRNNTSKYLKR I LR LQIRASPSGSKA 

U42766 YKLIFTVFHI IAIICSTFANPLLYCWNSNYRKAFLSAFR CEQRLDAIHSEV 

U76254 YKLIFTVFYI IAMCSTFANPLLYCWNSNYRKAFLSAFR CEQRLDAIHSEV 

U42389 YKLIFTVFHI IAMCSTFANPLLYCWNSNYRKAFLSAFR CEQRLDAIHSEV 

U50144 YKLIFTVFHI I AMCSTFANPLLYCIUNSNYRKAFLSAFR CEQRLDAIHSEV 

086238 YKLIFTVFHI (AMCSTFANPLLYGIHNSNYRKAFLSAFR CEQRLDAIHSEV 

H81490 LPYVWFAFHILAIISHCCYNPI IYCYUNARFRSGFVQLHHRHPGLRRICCLRSVGORMNAT 

AF037444 VNVVRLCAQILAHSHSCYNPFVYFSLSATFRRNLRRIfTHACRLKQKR-LRQHLSllRSSRA 

•|| | 

GPR¥2I S ADLDLKT I CM — -PATEEVDC I RLK 

ALI2175S S ADLDLRTNGV — PTTEEVDCIRLK 

AF236082 S ADLDLRTTGI — PATEEVDC I RLK 

U42766 SVTFKAK KNLEVRKNSG — PNDSFTEATNV 

U76254 SVTFKAK KNLEVRKNSG — PNDSFTEATNV 

U42389 SVTFKAK KNLEVRKNSG — PNDSFTEATNV 

U50144 SVTFKAK KHLQVTKNNG — PNDSFTETTNV 

D88238 SMTFKAK KNLEVKKNNG — PTDSFSEATNV 

M81490 SGTCPALPLN— RMNTSTTY I SARRKPRATSLRANPLSCGETSPLR 

AF037444 DAIDRDTEVYGSAES I PSKVSAGSLHSSNRGAKHVNTSSGEIQCLKEKKLKGVSNDIiYL 
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Figure 21 



** 

*«*##• mi «t 

1 HEDLFSPSIL PPAPNISVPI LLGWGLNLTL CQCAPASGPP SRRVRLVFLC VILVVAVAGN 60 

tt*t****t*mM*mtmmmmmmm4m****tt*****tt*t***** 
f*ft tl#tl#t# TH2 tftfttfftt 

61 TTVLCRLCGG GGPWAGPKRR KHDFLLVQLA LADLYACGGT ALSQLAWELL GEPRAATGDL 120 

****************************m«mm*m*Mmm*t************ 
§ ttttt TM3 Utftftttl ttttttt TH4 lift 

121 ACRFLQLLQA SGRGASAHLV VI I ALERRRA VRLPHGRPLP ARALAALGWL LALLLALPPA 180 

* 

************** 

"* . t tittttttt TM5 

181 FVVRGDSPSP LPPPPPPTSL QP6APPAARA IPGQRRCHGI FAPLPRIHLQ VYAFYEAVAG 240 

************************M******************m********M********* 
iUUiUUt* ffllf 

241 FVAPVTVLGV ACGHLLSVW1 RHRPQAPAAA APISASPGRA PAPSALPRAK VQSLKMSLLL 300 

4************************4*************************)***** 
I THfi tittttttt ttttt** TM7 ttttttttt* 

301 ALLFVGCELP YFAARLAAAV SSGPAGDWEG EGLSAALRVV ANANSALNPF VYLFFQAGDC 360 

361 RLRRQLRKRL GSLCCAPQGG AEDEEGPRGH QALYRQRHPH PHYHHARREP LDEGGLRPPP 420 
421 PRPRPLPCSC ESAF 
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Figure 23 



HSH«_1 

D49783 

M32701 

U25440 

S57S6S 

S73473 

H7471S 

U64032 

L4I147 

CPRv47 

D43633 



HSH2R 1 

D49783 

M32701 

U25440 

S57S6S 

S73473 

M7471S 

U64032 

L41I47 

CPRV47 

043633 



MSH2R_! 

049783 

M32701 

U2S440 

SS7S6S 

S73473 

M7471I 

U64032 

L4II47 

CPRv47 

D43633 



KSH2R 1 

049783 

113 2701 

U25440 

SS7S6S 

S734T3 

H74716 

U64032 

L41147 

CPRv47 

D43S33 



MTFRDLLSVTFEGPRPDISAGGSGAGGGAGAGACAGOTASSESPAVGGVPGAAGGGGGGS 



■ NG- 

-MAP NG- 

11$ -NG- 

-MAF — -MG- 



•If 

-I TV 
-IT¥ 
-VSV 
-ISV 
-ISV 

UAPIPHKNGS LAF1SDAPTL0PSAANTSGLPG-VPI-AAAL 

iAPIPHKRGS LAFRSDAPTLDPSAANTSGLPG-VPI-AAAL 

WGACSGEDNRSSAGEPCCAGGGGE VHGTAAVCGLVVSAQSVG VGV 

-■VAA 



-HEP NG- 



-TAS SFCLDSTACK I T- 
-TAS SFCLDSTACK I T- 
-TGSSFCLDSPPCRIT- 
-TVPSFCUDFTVYKirT- 
-TVHSCCLDSHALKVT- 



-MVP EPGPT ANSTPAWGAGPPSAPGGSG- 



ItESSPI PQSSGNSSTLGRVPQTPGPSTASGVPEVGLRDV-ASESVALF 

MMADKTSP8I TSOHS I SNFSTGLFGPHPTVPPDVGVVTS5QSQMK0LFGLF 

••I Til IIIIMM J»l#### TM2 M«M« 

VLAVL I L ITVAGNVVVCLAVCLNRRLRNLTNCFIVSLA I TDLLLCLL VLPFSA I YflLSCK 
VLA VL I L I TVAGNVVVCLAVGLNRRLRNLTNCFIVSLAI TOLLLCLLVLPFSA I YqLSOC 
VLTVLILIT I AGNV V VCL A VGLHR RLR SLTNCF I VS L A I TOLLLGLLVLPFSAF YDLSCR 
I L 1 1 L I LVTVAGItVyVCLAVGLNRRLRSLTNCF I VSLAVTDLLLCLL VLPFSA I YflLSdC 
VLTTL I L I T lAGNVVYCLAVSLNRRLRSLTNCF I V5LAATDLLLCLL VLPFSA I YQLSFT 
AGALLALAT VGCHLLV I TA I ARTPRLQT I TNVFVTSLATADLVVGLLVHPPGATULTGH 
AGALLALATVGGNLLVITAIARTPRLQTITNVFVTSLATADLVVGLLVypPGATLALTGH 
FLAAF I LTAVAGNLLV I ISVACHRHLQTVTH YF I VNLAVA0LLLSATVLPFSATIIEV1.GF 
ALCVVIALTAAANSLLIALICTQPALRNTSNFFLVSLFTSDLMVGLVVMPPAIILNALYGR 
FMLLLDLTAVAGNAAVMAV I AKTPALRKFVFVFHLCLVDLLAALTLMPLAMLSSSALFDH 
CMVTLNL I A LLANTGVMVA I ARAPHLKKFAF VCHLCA VOVLCA I LUIPLGI I SSSPFFGT 

•*• IMMimt TH3 tfflttttt $$$»$ TIM 

■SFGKVFCN I YTSLDVMLCTAS I LNLFM I SLDRVCAVUDPLRYP VLVTPARVA I SLVl 1 1 
•SFCX VFCN I YTSLDVMLCTAS I LNLFMISLDRYCAVNDPLRYPVLVTPVRVA I SLWl 1 1 
■SFGKVFCN I VTSLOVMLCTAS I LNLFHISLORYCAVTOPIRYPVL 1 1PVRVAVSLVL 1 1 
VSFSKVFCN I YTSLOVMICTAS I LNLFMISLDR YCAVTDPLR YPVL 1 TPARVA I SLWti 
■SFGHVFCN I YTSLDVMLCTAS I LNLFM I SLDR YCAV TDPLRYP VLVTP VR VA I SLVF I f 
■PLCATGCELITSVDVLCVTASIETLCALAVORYLAVTNPLRVGTLVTKRRARAAVVLVI 
VPLCATGCELITSVDVLCVTASIETLCALAVDRVLAVTNPLRYGTLVTKRRARAAVVLVI 
WAFGRAFCD VRAAVDVLCCTAS I LSLCT I S VDR YVGVRHSLK YPA I MTERKAAA I LALLI 
IVLARGLCLLITAFDVMCCSAS ILNLCLI SLD8YLL I LSPLRYKLRHTPLRALALVLGAI 
ALFGEYACRIYLFLSVCFVSLAILSVSAIHVERYYYVVHP8RYEVRMTLGLVASVLVCV8 
V VFT I LECQVY I FLN VFL I ILS I LT I TA 1 5VERYFY I VHP1RYEVKMT I NLV I GVMLl 1 1 
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Figur 24 

Itieilltlllt * THS SIM8M8 



HSH2R I VISITLSFLSIHLCWN — SRNETSKCNHTTSKCN YQVIOYGLVOCLVTFYLPLLHICI 

D49783 VISITLSFLSIHLGWN — SRNETSKCNHTTSKCK — — VQVKVYCLVOCLVTFYLPLLIMCI 

M3270 1 V I S I TLSFLS 1 HLGWN — SRNETSSFNHT I PKCK VQVNL VYCLVDCLVTF YLPLLVKC I 

U25440 V I S I TLSFLS I HLGWN — SRNETSKDNDT I YKCK VQVNEVYGLVDGLVTF YLPLL 1 Kl 

S57 585 V I S I TLSFLS I HLGWN S RNCTRCGN-DTFKOC VQVNE VYGLVOCLVTF YLPLL I MCW 

$73473 I VSATVSFAP I NSQWIRVCADAEAQECHSNPRCCS FASNMPYALLSSSVSFYLPLLVMU 

N7471 8 I VSATVSFAP I NSQW1R VCADAEAOECHSN PRCCS F ASNW PYALLSSS VSFYLPLLVMLF 

U64032 AVALVVS-NGPLLCWK EPVPPD — ERFC GITEEVGYAVFSSLCSFYLPRAVIVV 

L41M7 SLAALASFLPLLLCWH ELCHARPPVPGQC RLLASLPFVLVASGLTFFLPSGAICF 

GPRv47 VKALANASVPVLCRVS-WEEGAPSVPPG CSLQWSHSAYCQLFVVVFAVLYFLLPLLLILV 

D43C33 FKSLLLA-LVTLFCWPPVCHQSS1AASH CS LHASHS ft LRGVFAVLFCV I CFLAPY VV I FS 

t : • * : 

HSH2R ! TYYRIFRVAROQAKRIO-HIS SWKAAT I ft 

049783 TYYR IFKVARDQAKR I H-H I S SWKAAT I ft 

H32 701 TYYR I FK I ARDQAKR I H-HMC SWKAAT » G 

U25440 TYFRI FK I AREQARR I N-tt IG SWKAAT I R 

S 57 565 TVYRIFKI AREQAKR I N-H • 5 SWKAAT I R 

S73473 VYARVFVVAKRQRRLLRRELCRFPPEESPRSPSRSPSPATVCTPTAS DCVPSCGR 

N747I6 VYARVFVVAKRQRRFVRRELCRFPPEESPRSPSRSPSPATVGTPTAS OCVPSCGR 

U64032 WYCRVYV VARSTTRSLEACVKRERGKASEVVLR IKCRGAASGAOGAFGTRGAKGHTFRSS 

L41 147 TYCR I LLAARKQAVQVASLTT6 MASQASETLQVPRTP— 8 — PG YES ADS 

CPRv47 VYCSVFRVARVAANQHCPLPTWHETP RQRSESLSSR— S TNVTSSGA 

D43633 VYSAVVKVARSAALQQVPAVPTWADAS PAKDRSDS INSQTT II TTRTLP 

♦ : : : 

tllltlllt TM6 IMtllt HSIUfMII 

HSH2R 1 EHRATVTLAAVNGAFI I CWFPYFTAFVYRCLRGODA I NEBLEA I YLWLGY 

049783 EHKATVTLAAVMGAF 1 1 CWFPYFTAFVYRCLRGODA INEVLEAI YL1LGY 

H327QI EHKATVTLAAVMGAF 1 1 CWFPYFTVFVYRGLKCODA INEAFEAVVL1L6V 

U25440 —EHKATVTLAAVMGAF I I CWFP YFT VFVYRGLKCDDAVNEVFEDYVLILCY 

S 5 7 5I S EHKATVTLAAVMGAF 1 1 CWFPYFTAFVYRCLRGODA INEAVEG I VLILGY 

$73473 RP ARLLPLG-EHR ALRTLCL I KG I FSLCWLPFFLANV LRALVGPSLVPSGVF I ALNVLGV 

M747 16 RPARLLPLC-EHRALRTLCL I NC I FSLCWLPFFLANVLRALVGPSLVPSCVF I ALN1LGV 

U64032 LSVRLUCFSREKKAAKTLA I VVGVFVLCWFPFFFVLPLCSLFPQLKPSECVFICVIFWLGV 

L41 147 RRLATKHSRKALKASLTLCI LLGMFFVTWLPFFVANI VQAVC — DC ISPCLFDVLTWLGY 

GPRV47 PQTTPHRTFCGGKAAVVLLAVGGQFLLCWLPYFSFHLYVALSAQPIST6RVESVWT1I6T 

D43633 QRLSPERAFSGGKAALTLAF I VGQFLVCWLPFF I FHLQM$LTG$MK$PQ)LEEAVN1LAY 

:t .$ : • • : . : t:.8 

TM7 Mt»t*tf8 

HSH2R 1 ANSALNP I LY AALNRDFRTCYQQLFCCRLANRNSHKTSLRSRASQLSRTQSREPR Q 

049783 ANSALNP I LYAALNRDFRTGYQQLFCCRLANRNSHKTSLRSNASQLSRTQSREPR— « 

M32701 ANSALNP I LYAUNRDFRTAYQQLFRCRPASHNAQETSLRSNSSQLARNQSREPM ft 

U25440 ANSALNP I LYAALNRDFRTAYHQLFCCRLASHNSHETSLRLNNSQLNRSQCQEPR i, 

SS758S ANSALNP I LYAALNRDFRTAYQQLFHCKFASHNSHKTSLRLNNSLLPRSQSREGR W 

S73473 ANSAFNPLIYCR-SPDFRDAFRRLL-CSVGGRGPEEP — RVVTFPASPVASR— 

M74716 ANSAFNPLIYCR-SPDFRDAFRRLL-CSYGGRGPEEP — RVVTFPASPVASR 

U64032 FNSCVNPLIYPCSSREFKRAFLRLLRCQCRRRRRRRPLVRVYGHHWRASAGGGPHPDCAL 

L41 147 CNSTHNP 1 1 VPLFMRDFKRALCRFLPCPRCPRERQAS-LASPSLRTSHSCPRPGLS 1 

GPRv47 FCFTSNPFFYGCLNRQIRGELSKQFVCFFKPAPEEELRLPSREGSIEENFLQ F 

D43633 SSFAVNPSF YGLLNRQ I RDELVKFRRCCVTQPVE I GP — SSLEGSFQENFLQ F 
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Figure 25 



HSH2R_1 

D49783 

U32701 

U25440 

S57565 

S73473 

H747I6 

U64032 

L41147 

GPR»47 

D43S33 

HSH2R_1 

049783 

H32701 

U2S440 

S57S6S 

S73473 

U74716 

US4032 

L41147 

CPRv47 

D43S33 



QEEKPLKLQVISGTEVT 

QEEKPLKIQVISG7EV7APQGA7DR- 
QEEKPLKIQYRSCTEVTAPRGATDR- 
QEDKPLNLQV1SCTEVTAPQCATNR- 
QEEKPLKLQVISC7ELTHPQGNP I R- 
QNS-PLNR — FDGYEGERP-FPT — 
QNS-PLNR — FDGYEGERP-FPT 



SACAALPGAALALTAAPAPSSAAAPEGQAAGAGRRKPPCAFREVRLLGPLRRPTTQLRAK 
QQVLPLPIPPDSDSDSDAGSCGSSGLRLTAQLLLPGEATQDPPLPTRAAAAVNFFN | DPA 
LQGTGCPSESUVSRPLPSPKQEPPAVDFR I PGQ I AEETSEFLEQQLTSDI I VSDSYIRPA 
I QRTSSSSETHPSFANSNP-RNHENQAHK I PGQ J PEEQA 



VSSLSHX I RAGGAQRAEAACAlRSEVEAVAtSVARDVAEDNTCQAYELADYRNLRETDI 

EPELRPHPLG I PTN 

ASPRLES 



.1243648A1_I_> 
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r xyux e ^ / 





tlltiliiii Till t 


GPRvSI 


HMQTLHSSGTVESALHYSRGSTVHTAYLVLSSLAUFTCLCGIU 


M35297 


HAGNCSWEAHSTNQNKUCPGUSEALELYSRGFLT 1 EQ 1 ATLPPPAVTNY 1 FLLLCLCGLV 








fltfff ###tf f ft Til? litifttif 


GPRvSI 


GNSUVI VLLGFRMHRNPFC 1 Y 1 LNLAAADLLFLFSHASTLSLETOPLVN-TTMCVHFI V* 


M35297 


GNCLVLIFFGFS 1 KRTPFS 1 YFIHLASADG 1 YLFSKAV 1 ALLNMGTFLGSFPDYVRR VSR 








Stltttll TU3 tlilltil iffittii Till ittiftii 


GPRvSt 


RLtfYFAYTVGLSLLTAISTflRCLSVLFPifFKCHRPRHLSAiVCCLLfTICi 1 tfWHl ret 


H3S297 


1 VGLCTFFAGVSLLPA 1 S 1 ERCVSY 1 FPWYIRMPKRISAGVCAILIUSFLYTS 1 HNY 




m :tt:«:w:#: :**::m tt.tu . 




I fititlilt TB5 illtllti 


GPRv51 


FCSKFLKFHE-DRCFRVOUVQAALIVGVLTPVinLSSLTLFVIVRRSSOOlRROPTRLPV 


■35297 


FCUFLGHEASGTACLNUD ISLGI LLFFLFCPLUVLPCLAL 1 IHVECRARftRQRS-AKLNH 








tlttltll TUfi f Siltf Iff ifilftllit TU7 lfltttt 


CPRvSI 


VVLASVLVFL 1 CSLPLS 1 YIFVLYILSLPPEHOVLCFSt SRL SSSVSSSAMPV 1 YFI VM 


H35297 


VVLA 1 VSVFLVSS 1 YLG 1 DIFLF1 VFQ 1 P — APFPEY VTOLC 1 C 1 NS SAKP 1 VYFLAGR 




mttm:.t: : :; t, . :.m:*::m.* 


GPRvSI 


RRSHRLPTRSLGTVLQQALRE—EPELEGGETPTVGTNEIIGA- 


1135297 


ftK^OBI IPP-I DVVEflOAl DnriCDfTkAi C CTDUTUTUcunrocrUAr 
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Figure 28 
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Y14705 

AJ277752 

AF03I897 

X999S3 

Af 06 9555 

X9B28J 

D63665 

CPRv71 



T 14705 

A J 277752 

AF03U97 

X999S3 

AF069S6S 

X982.83 

D6366S 

CPRv71 



Y1470S 

AJ277752 

AF0318S7 

X99953 

AF069555 

X9828J 

D6366S 

CPRv71 



Y1470S 

AJ2777S2 

AF03U97 

X99953 

AF0E95SS 

X9828S 

D6366S 

GPRvTI 



Y1470S 

AJ2777S2 . 

AF03I897 

X999S1 

AF069S5S 

X98283 

DC366S 

GPRv71 



Mflllttl Til It 

-■TSAESLLFTSLGPSPSSGDGDCRFNEEFKF I ILPWYAWFVLGUL 
-IITSADiLLFTSLGPSPSSGDCDCICFNEEFICFIlLPlSYAVYFVLGUL 
— MDAP VR«F3LAPiTPTPTPlLGGNTTAAAEAKCVFNEEFICF I LLP I SYG I VFWCLPL 
MTED I HATSYPTFITTPYLPHKLUNLTNDTED I CVFOEGFKFLLLPVSYSAVFMVGLPL 

VSUAHFTA — CR MSC7FQEEFICQVLLPLVYSVVFILCLPL 

HSMANFTG — CR— NSCTFHEEFKQVLLPLVYSVVFLLGLPL 

WE RDNGT 1 6APGLPP TTCV YREDFKRLLLPPVYSWLVVGLPL 

«EK VDiNTSQ— EQ-— — CLCBF5EK YKQV7LSLATS 1 1 F ILGLPL 

»»»»* IMIMII 7M2 ttttttti tftt 

NAPTLILFLFRLRPVDATATVIIFHLALSDTLYVLSLPTLVYVYAARNHIPFGTGICKFVR 
NAPTLWLFLFRLRPIDATATYMFHLAL5DTLYVLSLPTLVYYVAARNHRPFGTGFCKFVR 
NSiAUff I FVSRHRPVNATTTYyFNLAISDTLYVFSLPTLVVYYAORNNIPFGKVFCKIVR 
N I AAKI I F I AKMRPVNPTTVYNFNLAL5DTLYVLSLPTLVYYVADKNNIPFCEVICKLVR 
HAW I CQ I ILARKALTRTT I YMLNLATADLLVVCSLPLL I YNYTQKDYWPFCDFTCKFVR 
NAVV I CQ I ILARKALTR7T I YULNLAMAOLLYVCSLPLL I YNYTQKDYVPFGDFTCKFVR 
NVCVIAQICASRRTLTRSAVYTLNLALADLLYACSLPLIIYNYARGDHIPFGDLACRLVR 
NGTVLVHFVGQTKRISCATTYLVNLHVADLLYVL-LPFL 1 1 T YSLDDRIPFGELLCKLVH 

* •: : ::*.:•:»«. tt «: * : : mm 

•MTi3 ItttlMfMM llftllM 714 lllft 

FLFYW1L YCSVLFLTCI SVHRYLG ICHPLRA I RfGRPR-FASLLCLGVVLVVAGCLVPNL 
FLFVMLYCS VLFLTC I SVHR YMCI CHPLRA I RIGfiPR-FAGLLCLGYILVVAGCLVPML 
FLFVANL YSS I LFL TCISVHR YUGI CHP I RS LKIYKTK-HARL I CYGVll wr ICL I PNL 
FLFYANLYSS I LFLTCISVHRYRCVCHP I TSLRRIMAK-HAYV I CALmSVrLCLVPML 
FQFYTNLHGS I LFL7CI SVQRYNGI CHPUStMKKKGKKLTILYCAAYlTI VIAQCLPTF 
FQFYTNLHGS I LFLTCISVQRYHGICHPLASIHKKKGXKLTlLVCAAViTI VIAQCLPTF 
FLFYANLHGS I LFLTC I SFQRYLCI CNPLAPWXRGGRRAAIVVCGVVILVVTAQCLPTA 
FLFY INLYCSI LLLTC I SVHQFLGVCHPLCSLPYRTRR-HA1LGTSTTIALVVLQLLPTL 

* *♦ »:»:»»»«.:;: » »,»: . : : ; .* * : * # 

*• ttttltttl TUS ttltlll 

FFVTTMAKG7T I LCHDTTLPEEFDHYVYFSSAVMVLLFCLPFL I TLVCYGUIARRLYRPL 
FFVTTNANG7T I LCHDTTLPEEFDHYVYFSST ( HVLLFGFPFL I TLVCYGLMARRLYRPL 
I FVTTSSKDNSTLCHDTTKPEEFDHYVHYSSS I MALLFGI PFLV I VVCYCLMAKRLCKRS 
I FVTVSPKVKNT I CHDTTRPEDFARYVEYSTA I MCLLFC IPCLI I AGCYCLHTRELMKPI 
VFASTGTO.RNRT VCYDLSPPDRSASYFP YC I TL7 1 TGFLLPFAA I UCYCSBAR I LCQKD 
VFASTGTORNRT VCYOLSPPDRSTSYFPYG I TLT » TGFLLPFAA I LACYCSMAR I LCQKD 
VFAATGK)RNRTVCYDLSPP I LSTRYLPYGHALTV IGFLLPFTALLACYCRMARRLCRQD 
AFSHTOY I NGQM I ■YDBTSQEHFDRLFA YC I VLTLSGFLSLLGHFGVLFTOGQEPDQARG 

* : 

ffltlltf TMG f Jfftttfl ff 

PCAGQS SSRLRSLRTI AVVLTVFAVCFVPFHI TRTI YVQAR-LLQADCHVLN I VMWV 

PCAGQS SSRLRSLRT I AVVLTVFAVCFVPFHI TRT I Y YLAR-LLNAECRVLN J VNW 

FPSPSPRVPSYKKRSIKW 1 1 VLTVFA ICFVPFHi TRTLYf TSR-YFQADCQ7LNI IUFT 
VSGNQQTLPSVKKRS IKTI IFVMIAFAI CFMPFH I TRTLY VYAR-LIC I KCYALNV I NVT 
El I GLAVH-KKKDKAVRN 1 1 1 V V I VFS I SFFPFHLTKT I Y L I VRSSPTLPCPT LOAF A I A 
EL I CLAVH-KKKDKA VRM 1 1 1 V V I VPS 1 5FFPFHLTKT I YL I VRSSASLPCPTLQAFA I A 
CPAGPVAQ-ERRSKAARHAVVVAAVFVISFLPFHITKTAYLAVRSTPCVSCPVLETFAAA 
EPHEDRQHSPSOVHPDHPTGVIPLHPLFCALPYHSLLLPHHLLSAFSGLPALDGSQCGLQ 
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Figure 30 



Y14705 

AJ2777S2 

AF031897 

X99953 

AF069SS5 

X 98 283 

06366S 

GPRv71 



Y147DS 

AJ277752 

AF031897 

X99953 

AF069SS5 

198283 

06368S 

CPRV71 



Y 14705 

AJ2777S2 

AF031897 

X999S3 

AF069SSS 

X98283 

D63665 

CPRv71 



ttttt TM7 tftttffSf 

YKVTRPLASANSCLDPVLYLFTGDKYRNQLQQLCRGSK — PKPR- 
YKVTRPLASANSCLDPVLYLFTCDKYRNQLQ.QLCRGST — PKRR- 
YK I TRPLAS I NSCLDP I LYFHAGDKYRCRLRRGAAQR P-R-- 



— TAASSL 
-TTASSL 
-PVPTSL 



YK VTRPLASANSC I DP I LYFLANDRYRRRL I RT VR R RS S VPNRRCHHTNHPQTEPHMTAG 

YKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKHS SKIRHD 

YKCTRPFASMNSVLDPILFYFTQRKFRESTRYLLDKHS SKHRQD 

YKG7RPFASANSVLDPILFYFTQQKFRRQPHDLLQKLT AKIQRQ 



OUEASGECEQLPQPSPVLSFKGGKNRVRLLQKLRQNKLG- 

* • • • ^ • ^ • 



-EHPAGRK 



ALVTLHEES ISRWADTHQDSTFSAYECDRl 

ALVTLHEES I SRIAD I HQDS I FPAYEGORL 

LALVSPSVDSSVVCSCCNSE SRGHGTVWSRGGQ- 



PLPV I SAEE I PSNGSMVRDENGEGSREHRVEWTDTKE I NQHUNRRST IKRNSTDKNOMKE 
HC I TYGS — 



HCISYGS- 
RV 



RCPGLNRSG- 



NRHGENYLPYVEVVEKEDYETKRENRKTTEQSSKTNAEQDELQTQIOSRLKRGKIQLSSK 



Y 14705 '-. 

AJ2777S2 

AF031897 : 

X999S3 KGAAQENEKGKIIEPSFEGEGTSTVNLLTPKMYGKKDRLAKNVEEVGYGKEKELQNFPKA 

AF069555 

X98283 : 

D63665 

GPRv71 
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Figure 32 



Iti#l#ttf Till fftltftl 

U03866 MVFLSGNAS--DSSNCTQPPAP VN I SKA I LLGV I LGCLI LFGVLGN I LV 

L31774 MVFLSGNAS — DSSNCTQPPAP — 1 — VN I SKA I LLGV I LGGL I LFGVLGN I LV 

D25235 IIVFLSGNAS — DSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

D3 2 202 IIVFLSGNAS — DSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

D32201 IIVFLSGNAS — DSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

AF013261 IIVFLSGNAS — DSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

U81982 IIVFLSGNAS — DSSNCTHPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

U07126 MVLLSENAS — EGSNCTHPPAP VN I SKA I LLGV I LGCLI IFGVLGNILV 

S7 1 323 NVPVLDNMTPSSVTL — NCSNCSHVLAPE LNTVK AW LGMVLG I F I LF GV I GN I LV 

D63859 IITPSSVTL— NCSNCSHVLAPE — ; - LNTVKAVVLGllVLGI F I LFGV I GNILV 

AF091890 MSLNSSLSCRKELSNLTEEEGG ECGVI ITQFIAI IVlTlFVCLGNLVI 

GPR»72 MTSTCTNST — RESNSSHTCMPLSKMP I SLAHCI IRSTVLVIFLAASFVGNIVL 

. : tt :. : : : :»*::: 

itt timiiui tn2 tiiutittti emitm 

U03B66 | LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGY1AFGRVFCN I WAAVDV 

L3 1 774 I LSVACHRHLHSVTHYY I VNLAVADLLLT ST VLPFSA I FEVLGYIAFGR VFCN I WAAVDV 

D25235 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLCYWAFGRVFCN I WAAVDV 

D32202 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I VAAVDV 

D3 2201 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I IAAVDV 

AF013261 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I WAAVDV 

UB 1 982 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FE I LCYWAFGRVFCN I IAAVDV 

UD71 26 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FE I LGYWAFGRVFCN I VAAVDV 

S7 1 323 I LSVVCHRHLQTVTYYF I VNLAVADLLLSSTVLPFSA I FE I LDRWVFGRVFCN I VAAVDV 

D63859 I LSVVCHRHLQTVTYYF I VNLAVADLLLSSTVLPFSA I FE I LDRWVFGRVFCN I VAAVDV 

AF091890 VVTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFVVTSS I RREWIFGVVWCIIFSALLVL 

GPRv72 ALVLQRKPQLLQVTNRF I FNLLVTDLLQ I SLVAPWVVATSVPLFWPLNSHFCTALVSLTH 

ft ft ft* ft * *& • 

I TN3 ttllltttt tfttttlftf TN4 tflflfflt 

U03866 LCCTAS I NGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVWALSLV I S I GPLFGVR- 

L31774 LCCTAS I NGLC MSI DRY I GVSYPLRYPT I VTQRRGLMALLCVWALSLV I S I GPLFGWR- 

D2 52 3 5 LCCTAS I NGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVWALSLV I S I GPLFGWR- 

D3 2202 LCCTAS I NGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVWALSLV I S I GPLFGVR- 

D3 2201 LCCTAS I NGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVVALSLV I S I GPLFGVR- 

AF01 326 1 LCCTAS I NGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVVALSLV I S I GPLFGVR- 

U81982 LCCTAS 1 1 SLCVI S IDRY I GVSYPLRYPT I VTQRRGLRALLCVWAFSLV I SVGPLFGVR- 

U07 1 26 LCCTAS I NGLC 1 1 S IDRY I GVSYPLRYPT I VTQRRG V RALLC VWVL SLV I S I GPLFGWR- 

S7 1 3 23 LCCTAS I MSLCV I SVDRY I GVSYPLR YPA I MTKRRALLAVMLLVVLSV 1 1 S I GPLFGWK- 

D638S9 LCCTAS I MSLCV I SVDRY I GVSYPLR YPA I MTKRRALLAVMLLVVLSV I ISIGPLFGWK- . 

AF091890 L I SSASMLTLGVI AIDRYYAVLYPMVYPMK ITGNRAVMALVY I VLHSLlGCLPPLFGVSS 

GPRv72 LFAFASVNT I VVVSVDRYLS I IHPLS YPSKMTQRRGYLLLYGTW I VA I LQSTPPLYGWGQ 
t . tt: .: ** :* * ::: . ♦*:** 
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Figure 33 



*t § Hlllltll TII5 tSfttltfff 

U03866 QPAPEDET I CQ I NE — EPGYVLFSALCSFYLPLA 1 I LVMYCRVYVVAKRESRGLK S 

L31774 QPAPEDET I CQ I NE — EPGYVLFSALCSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

D25235 QPAPEDET I CQ I HE— EPGYVLFSALCSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

D32202 QPAPEDET I CQ I NE— EPGYVLFSALCSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

D32201 QPAPEDET ICQ I NE— EPGYVLFSALCSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

AFO 13261 QPAPEDET I CQ I NE — EPGYVLFSALCSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

U81982 QPAPDDET I CQ I NE — EPGYVLFSALCSFY VPLT 1 1 LAMYCRVYVVAKRESRGLK~?— S 

U07126 QPAPEDET I CQ I NE — EPGYVLFSALCSFY VPLA 1 1 LVMYCRVYVVAKRESRGLK S 

S71323 EP APEDETVCK I TE — EPGYA I FSA VCSFYLPLA 1 1 LAMYCRVYVVAQKESRGLK E 

D63859 EP APEDETVCK I TE — EPGYAIFSAVGSFYLPLAI I LAMYCRVYVVAQKESRGLK E 

AF091890 VEFDEFKWMCVAAWHREPGYTAFWQ I ICALFPFLVNLVCYCFI FRVARVKARKVH C 

GPRv72 AAFDERNALCSM I WGASPSYT I LSVVSF I V I PL I VII I ACYSVVFCAARRQHALLYNVKRH 

ftfltftt 

U038BS CLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVVG- 

L3 1 774 CLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGl VVG- 

D25235 GLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVVG* 

D32202 CLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVVG- 

D32201 GLKTDKSDSEQVTLR IHRKNAPAGGSGHASAKTKTHFSVRLLKFSREKKAAKTLGI VVG- 

AF0 13261 GLKTOKSDSEQVTLR IHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGI VVG- 

U81S82 GLKTDKSDSEQVTLR IHRKNAPAGGSGVASAKNKTHFSVRLLKFSREKKAAKTLGI VVG- 

U07126 GLKTDKSDSEQVTLR IHRKNVPAEGGGVSSAKNKTHFSVRLLKFSREKKAAKTLGIVVG- 

S7 1 323 GQK I EKSDSEQV I LRMHRGNTTVSED— EALRSRTHFALRLLKFSREKKAAKTLGI VVG- 

DC38S9 GQK I EKSDSEQV I LRMHRGNTTVSED — EALRSRTHFALRLLKFSREKKAAKTLGIVVG- 

AFOS1890 GTVVI VEEDAQRTG RKNSSTSTS— SSGSRRNAFQGVVYSANQCK-ALITI LVVLG- 

GPRV72 SLEVRVKDCVENEDEEGAEKKEEFQD — ESEFRRQHEGEVKAKEGRMEAKDGSLKAKEGS 

* . • • • . • • , • • ¥ 

SttMftf TM6 l»ttltttltt«llf *9Utt$tiUi TM7 

U03866 CFVLC1LP FFLVMP I GSFFPD FKPSETVFK I VF1LGYLNSC I H 

L31774 CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFRLGYLNSC I N 

D25235 CFVLCILP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N- 

D32202 CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N 

D32 201 CFVLCILP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N 

AF0 13261 CFVLCILP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N 

US 1 982 CFVLCWLP FFLVMP I GSFFPD — FKPPETVFK I VFILGYLNSC IN 

U07 1 26 CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC IN 

S7 1 323 CFVLCWLP FFLVLP I CS I FPA YRPSDTVFK i TFWLGYFNSC I N 

063859 CFVLCWLP FFLVLP ICS I FPA — YRPSDTVFK I TFWLGYFNSC I N 

AF091890 AFMVTWCP YMVV I ASEALWGK — SSVSPSLETWATWLSFASAVCH 

GPRv72 TGTSESSVEARCSEEVRESSTVASDGSMEGKEGSTKVEENSMKADKGRTEVNQCSIDLGE 
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Figure 34 



ItlMllt 

U03866 -PI IYPCSSQEFK KAFQNVLR I QCLCRKQSSKH — ALCYT-LHPPSQAVECQHK- 

L3 1 774 -PI I YPCSSQEFK KAFQNVLR I QCLRRKQSSKH — AL6VT-LHPPSQAVEGQHK- 

D25235 -PI IYPCSSQEFK KAFQNVLR I QCLRRKQSSKH ALCYT-LHPPSQAVEGQHK- 

D32202 -P 1 1 YPCSSQEFK KAFQNVLR I QCLRRKQSSKH ALGYT-LHPPSQAVECQHX- 

D3220I -PI IYPCSSQEFK KAFQNVLR I QCLRRKQSSKH ALGYT-LHPPSQAVECQHK- 

AF013261 -PI IYPCSSQEFK KAFQNVLR I QCLCRKQSSKH ALGYT-LHPPSQAVEGQHK- 

U81982 -PI IYPCSSQEFK KAFQNVLK I QCLRRKQSSKH — ALCYT-LHAPSQALEGQHK- 

U07 1 26 -PI I YPCSSQEFK KAFQNVLR I QCLRRRQSSKH — ALGYT-LHPPSQALEGQHR- 

S71323 -P 1 1 YLCSNQEFK KAFQSLLCVHCLRMTPRAHHHHLSVGQSQTQGHSLTISLDSKG 

D63859 -PI I YLCSNQEFK KAFQSLLGVHCLRMTPRAHHHHLSVGQSQTQGHSLTISLDSKG 

AF091890 -PL I YGLVN KTVRKELLGMCFGDRYYREP — — FVQR — QRTSRLFS I SHR- 

GPRv72 DDUEFGEDD INFSEDDVEAVN I PESLPPSRRNSNSNP PLPRCYQCKAAKV I F 1 1 1 FS 



U038B6 DMVR I PVGSRETFYR I SKTDG— VCEMKFFSSMPRGSAR I TVSKDQS— SCTTARVKSKS 

L31774 DMYR I PVGSRETFYR I SKTDG — VCERKFFSSMPRGSAR I TVSKDQS — SCTTARVRSKS 

02523$ DMVR I PVGSRETFYR I SKTDC—VCENKFFSSMPRCSARITVSKDQS— SCTTARVRSKS 

032202 DMVR I PVGSRETFYR I SKTDG— VCEMKFFSSMPRGSARITVSKDQS— SCTTARTKSRS 

D32201 DMYR I PVGSRETFYR I SKTDG — VCEIKFFSSMPRGSARITVSKDQS — SCTTARGHTPM 

AF01 3281 DMVR I PVGSRETFYR I SKTDG — VCERKFFSSMPRGS AR I TVSKDQS — SCTTARRGMDC 

U8 1 982 DMVR I PVGSGETFYK I SKTDG— VCEWKFFSSMPRGSAR I TVPKDQS — ACTTARVRSKS 

U07126 DMVR 1 PVGSGETFYK I SKTDG — VCEWKFFSSMPQGSAR I TVPKDQS — ACTTARVRSKS 

S71323 APCRLSPSSSVALSRTPSSRO — SREVRVFSCGP I NSG— PGPTEAO— RAKVAKLCNKS 

D63859 APCRLSPSSSVALSRTPSSRO — SREVRVFSGGPINSG — PGPTEAC — RAKVAKLCNKS 

AF0918S0 - 1 TDLGLSPHLTALMAG GQPLGHS — SSTGDTG — FSCSQDSGH — 

GPRv72 YVLSLCPYCFLAVLAVIVDVETQVPQWVITI 1 1 VLFFLQCC I HPYVYCYMHXT I KKE I QD 



U03866 FLQVCCCVGPS-TPSLOKN — HQVPT I KVHT I SLSENGEEV : 

L31774 FLQVCCCVGPS-TPSLOKN— HQVPT I KVHT I SLSENGEEV 

02S235 FLEVCCCVGPS-TPSLOKN — HQVPT I KVHT I SLSENGEEV 

032202 VTRLECSG N I LAHCN — LRLPGSRDSPASASQAAGTTGOVPPGRRHQAQLIFVFLV 

D322Q1 . T 

AFO 13261 RYFTKNCR EH IKHVN — FMMPPVRKGLEC 

U8 1 982 FLQVCCCVGPS-TPNPGEN — HQVPT IKIKTI SLSENGEEV 

U07 1 26 FLQVCCCVGSS-APRPEEN — HQVPT IKIHTI SLCENCEEV > 

S7132S . LHRTCCC I LRARTPTQDPAPLGDLPT I K I HQLSLSEKGESV 

0638S9 LHRTCCC I LRARTPTQDPAPLGDLPT I K I HQLSLSEKGESV 

AF09I890 -LRAL : 

GPR¥72 MLKKFFCKEK — PPKEDSH — PDLPCTEGGTEGK I VPS YDSATFP 

U03B6C '• 

L31774 : 

D2S23S 

D32202 ETGFHHVGQDDLDLLTS 
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